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INSECT CHYMOTRYPSIN AND INHIBITORS THEREOF 



BACKGROUND OF THE INVENTION 
5 FIELD OF THE INVENTION 

The present invention relates generally to a novel chymotrypsin that exhibits resistance to a 
plant serine proteinase inhibitor. More particularly, the present invention provides a 
chymotrypsin which is up-regulated in the gut of Helicoverpa armigera and Helicoverpa 

10 punctigera insect larvae when fed the serine proteinase inhibitors of Nicotiana alata. The 
novel chymotrypsin represents, therefore, a target for the identification of antagonists 
including inhibitors which are proposed to be useful in the control of Helicoverpa spp. 
populations that have become resistant to serine proteinase inhibitors produced in plants. 
The antagonists of the chymotrypsin may be topically applied to the plants or, when in 

15 proteinaceous form, may be produced by genetic means in plant cells. The antagonists may 
act at the level of gene expression or protein activity. 

DESCRIPTION OF THE PRIOR ART 

20 Bibliographic details of the publications referred to in this specification are also collected 
at the end of the description. 

Reference to any prior art in this specification is not, and should not be taken as, an 
acknowledgment or any form of suggestion that this prior art forms part of the common 
25 general knowledge in any country. 

Female reproductive tissues and wounded leaves of the ornamental tobacco, Nicotiana 
alata amass high levels of serine proteinase inhibitors (Pis) for protection against pests and 
pathogens (Atkinson et al., The Plant Cell 5: 203-213, 1993). These 6 kDa Pis accumulate 
30 in the vacuole (Miller et aL, Plant Cell 11: 1499-1508, 1999) and are derived in vivo from 
the post-translational modification of a 40.3kDa precursor protein. The precursor of the PI 
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protein (referred to as "NaPI") is composed of six repeated regions of high sequence 
identity (Figure 1) each with a potential PI reactive site. Processing of the six-repeat 
precursor protein unexpectedly occurs at sites located within, rather than between the 
repeated regions. Complete removal of the linker sequence (Glu-Glu-Lys-Lys-Asn) [SEQ 

5 ID NO:l] contained within each repeated region, generates five contiguous 6 kDa 
inhibitors (CI and T1-T4) and a novel two-chain chymotrypsin inhibitor (C2) formed by 
disulphide bond linkage of N-terminal and C-terminal peptide fragments (Heath et al, 
European Journal of Biochemistry 230(1): 25-257, 1995; Lee et al, Nature Structural 
Biology 6(6): 526-530, 1999). The structures of CI, T1-T4 and C2 have been solved using 

10 'H-NMR techniques (Nielson et al, J. Mol. Biol 242: 231-243, 1994; Nielson et al, 
Biochemistry 34: 14304-14311, 1995; Lee era/., 1999, supra). 

Nicotiana alata also has a second gene related to NaPI that encodes a closely related 
precursor protein with four rather than six repeated domains (Miller et al, Plant Mol. Biol. 

15 42: 329-333, 2000). This precursor is also processed in vivo resulting in the release of three 
contiguous 6 kDa inhibitors (CI, T4 and T5) and the two-chain inhibitor C2 (Figure 1). 
Three of the inhibitors (CI, C2 and T4) are identical to those released from the six-domain 
precursor. Related multidomain precursors have been described for other solanaceous 
plants including N. tabacum (Balandin et al, Plant Mol. Biol. 27: 1197-1204, 1995), N. 

20 glutinosa (Choi et al, Biochim. et Biophys. Acta 1492: 211-215, 2000), L. esculentum 
(Taylor et al, Plant Mol. Biol 23: 1005-1014, 1993) and Capsicum annum (Moura and 
Ryan, Plant Physiol. 126: 289-298, 2001; Antcheva et al, Protein Sci. 10: 2280-2290, 
2001). 

25 Several groups have reported on the affect of serine proteinase inhibitors on the activity of 
the digestive proteases of insects and have suggested that they are produced by plants for 
protection against the damaging affects of insect pests and microorganisms (Ryan, Annu. 
Rev. Phytopathol. 28: 425-449, 1990; Gatehouse et al, In: Plant Genetic Manipulation for 
Crop Protection, Biotech, in Agriculture No. 7, Eds. Gatehouse, Hilder & Boulter, 

30 International U.K., pp. 155-181, 1992). Insects that are specialist feeders on a particular 
host plant are generally resistant to the serine Pis produced by that plant, but are sensitive 
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to Pis produced by non-hosts (Broadway and Villani, Entomol Expo. Appl. 76: 303-312, 
1995; Broadway, J. Insect. Ptysiol 41: 107-116, 1995). There is interest, therefore, in 
transferring genes encoding serine Pis from non-hosts into crop plants to enhance insect 
resistance and to decrease reliance on chemical pesticides. Recently, however, several 

5 groups have reported on the ability of certain insects to change the relative proportions of 
proteolytic enzymes in their midgut following ingestion of high levels of Pis (Broadway, 
1995, supra; Jongsma et al, Proc. Natl Acad. Set USA 92(17): 8041-8045, 1995a). 
Broadway (1995, supra), for example, found that certain lepidopteran insects produce two 
broad classes of trypsin like proteases, one of which is insensitive to Pis from cabbage 

10 leaves. After ingestion of the cabbage Pis the insects increased production of the trypsin 
class not affected by the Pis and thus were able to grow and develop unhindered. Jongsma 
and coworkers (1995, supra) made a similar observation with Spodoptera exigua larvae 
fed on Pis from potato (Potll) and tobacco. The factors that regulate the secretion of these 
proteases under these conditions are not known. 

15 

These studies indicate that Pis specific for only one or two of the range of proteinases in an 
insect gut will be of limited use for long term plant protection. The gene encoding the 
N.alata PI has a potential advantage over other plant Pis for the enhancement of insect 
resistance in transgenic plants. Most plant serine Pis contain only one or two inhibitory 
20 domains, whereas the N. alata PI precursors have four or six (Figure 1). Thus, there is 
potential to engineer the individual domains of the N. alata PI to provide inhibitory activity 
against several proteinases in the insect gut. 

The midgut proteases of several Lepidoptera, Coleoptera and Orthoptera have been 
25 partially characterized. In most Lepidopteran species the endoproteinase activity is due 
primarily to serine proteinases (trypsin, chymotrypsin and/or elastase) and cysteine and 
metalloproteinases are not detectable (Christeller et al, Insect Biochem. Molecul Biol 22: 
735^746, 1992; Terra and Ferreira, Comp. Biochem. Physiol 109: 1-62, 1994; Xu and Qin, 
J. Econ. Entomol 87: 334-338, 1994; Lee and Anstee, Insect. Biochem. Molec. Biol 25: 
30 63-71, 1995a; Johnston et al, Insect Biochem. 21: 389-397, 1991; Johnston et al, Insect 
Biochem. Molec. Biol 25(3): 375-383, 1995). Exopeptidase and leucineaminopeptidase 
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have also been identified (Christeller et al, 1992, supra; Lee and Anstee, Insect Biochem. 
Molec. Biol 25(1): 49-61, 1995b). 

The mechanism of action of Pis on insects is only partially understood. Three responses 
5 have been described:- 

(i) Severe retardation of growth without a decrease in gut proteolytic activity. 
Broadway and Duffey (J. Insect Physiol 32: 673-680, 1986a; Broadway and 
Duffey, J. Insect Physiol 32: 827-833, 1986b) found that insects fed on Pis had 

10 remarkably reduced growth rates that were not associated with a decrease in the 

total proteolytic activity in the gut. Indeed the gut proteolytic activity often 
increased. They suggested that a feedback mechanism was operating that led to 
hyperproduction of proteases, that led in turn to a depletion of essential sulphur 
containing amino acids. This phenomenon has been recorded for other insects after 

15 chronic ingestion of Pis (Burgess et al, Entomol Exp. App. 61: 123-130, 1991; De 

Leo et al, Plant Physiol 118: 997-1004, 1998; Markwick et al, J. Economic 
Entomology 91 {6): 1265-76, 1998). 

(ii) Severe retardation of growth with a decrease in gut proteolytic activity. Broadway 
20 (1995, supra) found that the lepidopteran species, Agrotis ipsilon (black cutworm) 

had reduced growth and delayed pupation after exposure to soybean trypsin 
inhibitor and did not respond by secreting Pi-insensitive proteases. These insects 
had up to a 70% reduction in total gut proteolytic activity. Codling moth larvae 
(LepidopteraiTortricidw) fed on 'elastase inhibitors' were also retarded in growth 
25 and development that was associated with diminished elastase activity in the gut 

(Markwick et al, Journal of Economic Entomology 88(1): 33-39, 1995). 

(iii) No effect on growth - change in the complement of gut proteinases. Some insects 
can compensate for the inhibition of one group of proteinases by inducing a new 

30 proteinase activity. The genomes of lepidopteran insects contain genes for a range 

of serine proteases and insects can modify the expression of specific isozymes to 
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suit dietary components (Bown, et al, Insect Biochem. Molec. Biol. 27: 625-638, 
1997; Broadway, J. Insect Physiol 43(9): 855-874, 1997). Changes in the 
complement of gut trypsins and chymotrypsins have been detected using Northern 
blot analysis on RNA from H. armigera (Bown, et al, 1997, supra; Gatehouse, et 

5 al, Insect Biochem. Molecul Biol 27: 929-944, 1997), H. zea and Agrotis ipsilon 

(Mazumdar-Leighton and Broadway, Insect Biochem. Mol Biol 31: 645-657, 
2001a; Mazumdar-Leighton and Broadway, Insect Biochem. Mol Biol 31:633- 
644, 2001b). Corresponding changes at the protein level have also been observed 
using electrophoretic separation of isozymes for K armigera (Harsulkar, et al, 

10 Plant Physiol 121: 497-506, 1999; Patankar, et al, Insect Biochem. & Mol Biol 

31: 453-464, 2001), Spodoptera frugiperda (Paulillo, et al, J. Econ. Entomol. 
P3:892-896, 2000), H. zea and Trichoplusia ni (Broadway, Arch Insect Biochem. 
Physiol 32(1): 39-53, 1996). Sometimes specific isozymes have been up-regulated, 
and occasionally proteases previously undetected have been observed. 

15 

Recently, Mazumdar-Leighton and Broadway (2001a, supra) demonstrated that the 
production of Pi-insensitive trypsins in H. zea is regulated at the transcriptional level and 
can be abolished using the transcriptional regulator actinomycin. Broadway and colleagues 
examined changes in gut trypsin and chymotrypsin activity after H. zea and Trichoplusia 

20 ni larvae were fed for 48 h on artificial diet containing 1% SBTI (Broadway, 1996, supra). 
Trypsin activity increased after SBTI consumption and protease banding patterns on 
zymograms indicated a change in the relative complement of proteases. The researchers 
showed in vitro that SBTI could inhibit 74% of the trypsin activity in gut extracts from 
control larvae, but only 3% of the gut trypsin activity in larvae that had consumed SBTI. 

25 They suggested the new protease bands (one new band for H. zea and 6 new bands for T. 
ni) on the zymograms may be SBTI-insensitive trypsins or SBTI-insensitive 
chymotrypsins and concluded that the production of these new proteases was enhanced by 
the ingestion of SBTI. Further studies using Northern blot analysis showed that 
consumption of SBTI resulted in transcriptional induction of mRNAs encoding trypsins 

30 and chymotrypsins by H. zea and Agrotis ipsilon (Mazumdar-Leighton and Broadway, 
2001a, supra; Mazumdar-Leighton and Broadway, 2001b, supra), although it was not 
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determined if these proteases were SBTI-insensitive. 

The novel trypsin transcript induced in K zea after ingestion of SBTI was designated 
HzT15 (Mazumdar-Leighton and Broadway, 2001b, supra). Recently, the first insect 

5 digestive enzyme insensitive to several proteinase inhibitors was purified from the gut of 
K zea and corresponds to the protein encoded by HzT15 (Volpicella et al, Eur. J. 
Biochem. 270: 10-19, 2003). The authors identified several differences in charge 
distribution across the surface of the structural model of this Pi-insensitive trypsin relative 
to the PI inhibitable trypsins, but were unable to identify the structural changes that led to 

10 resistance. 

Until recently, chymotrypsins were assumed to contribute relatively little to protein 
digestion in Lepidoptera and consequently most biochemical studies focused on 
characterization of the trypsins. This problem arose due to the initial use of synthetic 

15 substrates that worked well with mammalian chymotrypsins, but not at all or poorly with 
the Lepidopteran enzymes. Lepidopteran chymotrypsins prefer synthetic substrates with at 
least four amino acids to occupy the S1-S4 binding subsites on the enzyme, whereas 
mammalian trypsins are active on shorter substrates with one amino acid that is specific for 
the SI binding subsite. That is, the insect chymotrypsins appear to have an extended 

20 substrate binding site requiring at least four amino acids for efficient catalysis. Recent 
studies have shown that chymotrypsins do respond to PI ingestion and are worthy of more 
detailed investigation. When larvae from H. armigera were fed on diets consisting of either 
potato proteinase inhibitor II, soybean trypsin inhibitor, aprotin (trypsin inhibitor) or potato 
proteinase inhibitor I, levels of chymotrypsin mRNA increased in all cases while trypsin 

25 mRNA decreased (Gatehouse et al, 1997, supra). Other reports also mention upregulation 
of chymotrypsins in preference to trypsins (Bown et al, 1997, supra; Wu et al, Molecular 
Breeding 3: 371-380, 1997). Mazumdar-Leighton and Broadway (2001a, supra) assayed 
chymotrypsin activity in the gut ofK zea larvae and found that SBTI inhibited 95% of the 
chymotrypsin from the gut of control insects but only 35% of activity from the gut of 

30 insects that had prior exposure to SBTI in the diet. 
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Hence, consumption of proteinase inhibitors can lead to a drastic change in the 
complement of gut proteases which allows insects to adapt to the diet and survive. Changes 
in the complement of proteases after exposure to Pis have been detected in insects fed on 
both artificial diets and transgenic plants. The triggers that regulate these changes are still 
5 unknown and the responses vary with the species, the PI and its concentration, and the base 
diet. It is unclear why some inhibitors induce this response and others do not. It is clear, 
however, that some larvae are genetically pre-adapted to Pis, since prior exposure to a 
specific inhibitor is not necessary for an insect to be resistant (Broadway, 1996, supra). 

10 There is a need to identify and investigate novel insect proteinases which are insensitive to 
Pis and to use these to screen for antagonists of the proteinases in order to develop agents 
useful in controlling insect growth, maintenance, development and/or survival. 



WO 2004/094630 



PCT/AU2004/000524 



-8- 

SUMMARY OF THE INVENTION 

Throughout this specification, unless the context requires otherwise, the word "comprise", 
or variations such as "comprises" or "comprising", will be understood to imply the 
5 inclusion of a stated element or integer or group of elements or integers but not the 
exclusion of any other element or integer or group of elements or integers. 

Nucleotide and amino acid sequences are referred to by a sequence identifier number (SEQ 
ID NO:). The SEQ ID NOs: correspond numerically to the sequence identifiers <400>1 
10 (SEQ ID NO:l), <400>2 (SEQ ID NO:2), etc. A summary of the sequence identifiers is 
provided in Table 1. A sequence listing is provided at the end of the specification. 

The present invention provides a novel chymotrypsin from Helicoverpa spp. referred to 
herein as "HpCh5". Reference to "HpChS" includes all variants, derivatives, homologs and 

15 analogs as well as members of a HpCh5 family of chymotrypsins. Examples of variants of 
HpChS include proteinase inhibitor (PI) sensitive forms. Such sensitive forms may carry 
inter alia a substitution of the arginine at position 192 to an asparagine or glutamine. This 
substitution is referred to herein as "R192N/Q" using single amino acid nomenclature or 
"Arg 192 Asn/Gln" using three letter amino acid code. Other derivatives of HpF5 include 

20 the signal sequence of HpF5 . 

The HpChS chymotrypsin is encoded by a nucleotide sequence referred to as "HpF5". 
Again, reference to "HpF5" includes variants, homologs and analogs thereof. The term 
"HpF5" encompasses both a genomic sequence as well as a cDNA sequence. The amino 

25 acid sequence of HpCh5 is set forth in SEQ ID NO:2. The amino acid sequence of the N- 
terminal activation peptide is shown in SEQ ID NO:3. The nucleotide sequence of the 
coding region of HpF5 is set forth in SEQ ID NO:4 with the nucleotide sequence encoding 
the activation peptide is shown in SEQ ID NO:5 and its entire 5 '-3' sequence shown in 
SEQ ID NO:6. HpCh5 is generally characterized by being substantially insensitive to 

30 inhibition by a PI from N. alata. 
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Variants and homologs of HpCh5 include molecules having at least 75% amino acid 
identity to SEQ ID NO:2 after optimal alignment. Variants and homologs of HpF5 include 
nucleotide sequences having at least about 75% similarity to SEQ ID NO:4 or SEQ ID 
NO:6 after optimal alignment or a nucleotide sequence capable of hybridising to SEQ ED 
5 NO:4 or SEQ ED NO:6 or its complementary form under low stringency conditions. 

Accordingly, one aspect of the present invention provides an isolated nucleic acid 
molecule comprising a sequence of nucleotides encoding a chymotrypsin from 
Helicoverpa ssp. or a variant, derivative, homolog or analog of said chymotrypsin, wherein 
10 said chymotrypsin exhibits resistance to a PI from N. alata. 

Another aspect of the present invention provides an isolated chymotrypsin from 
Helicoverpa ssp. wherein said chymotrypsin exhibits resistance to a PI from N. alata or a 
variant, derivative, homolog or analog of said chymotrypsin. 

15 

The present invention provides compounds which inhibit HpChS or its variants and 
homologs or members of the HpCh5 chymotrypsin family or which inhibit expression of 
the HpF5 gene or its variants and homologs. 

20 The compounds may be chemical type compounds such as those sprayed or provided to 
plants or genetic type molecules which may be either topically applied or generated in 
plant cells. The HpChS or HpF5 antagonists may also be a modified form of an existing 
plant PI. 

25 The present invention provides, therefore, methods for inhibiting insect infestation of a 
plant or for retarding insect growth and development by the application or dispersement of 
an antagonist of HpCh5 activity or HpF5 gene expression. 

The antagonists include compounds which bind to and inhibit HpChS as well as antisense 
30 or sense nucleic acid molecules generated by a plant cell and then ingested by an insect. 
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Reference to an "antagonist" includes reference to an inhibitor. 

The present invention further provides genetically modified plants which are engineered to 
produce a HpCh5 or HpF5 antagonist. Reference to a "plant" includes a monocotyledonous 
5 or dicotyledonous plant and may be a plant regenerated from genetically transformed 
callus or tissue or progeny of such a plant. The present invention further provides seeds 
and other reproductive material from the genetically modified plants of the present 
invention. 

10 Plants contemplated herein include cotton, sweet corn, tomato, tobacco, piniento, potato, 
sunflower, citrus, plums, sorghum, leeks, soybean, alfalfa, beans, pidgeon peas, chick peas, 
artichokes, curcurbits, lettuce, Dianthus (an ornamental plant) and geraniums, cape 
gooseberry, maize, flax and linseed, alfalfa, lupins, broad beans, garden peas, peanuts, 
canoia, snapdragons, cherry, sunflower, pot marigolds, Helichrysum (an ornamental plant), 

1 5 wheat, barley, oats, triticale, carrots, onions orchids, roses and/or petunias. 

The present invention further provides nucleic acid molecules which encode potato- 
derived protenase inhibitors such as but not limited to Potl A and PotlB or their homologs 
or derivatives as well as transgenic plants comprising and capable of expressing same. 

20 

Yet another aspect of the present invention contemplates a method for screening for an 
inhibitor of an insect chymotrypsin which is insensitive to inhibition by NaPI such as 
HpCh5. Such a method generally involves testing for chymotryptic activity in the 
presence of potential inhibitors. The assay is conveniently contacted in vitro although the 
25 use of H. argmigera and/or H. punctiga is also encompassed by the present invention. An 
isolated inhibitor identified by the subject assay is also contemplated by the present 
invention. 
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A summary of sequence identifiers used throughout the subject specification is provided in 
Table 1. 



TABLE 1 

5 Summary of sequence identifiers 



" 'SECJUENfcE n) NO: ; : 


iv:-:;;/'-.'. •.,:.^fe-''.-.-.nvIc -i-:.je 


1 


linker sequence 


2 


Amino acid sequence of HpCh5 [Figure 12] 


3 


Amino acid sequence of the activation peptide of 
HpCh5 [Figure 12] 


4 


Nucleotide sequence of coding region of mature 
chymotrypsin domain of HpF5 [Figure 12] 


5 


Nucleotide sequence of activation peptide of HpF5 
[Figure 12] 


6 


Nucleotide sequence encoding activation peptide and 
HpCh5 mature chymotrypsin domain together with 3' 
UTR [Figure 12] 


7 


BamUl oligonucleotide primer 


8 


Hindll! oligonucleotide primer 


9 


N-terminal sequence of NaPI-insensitive chymotrypsin 
HpCh5 [Table 7, Figure 11B1 


10 


Fw2ResChy pnmer [lable /, figure llrJJ 


11 


FwResChym primer [Table 7, Figure 1 IB] 


12 


Hc35PQE-60-Fw primer 


13 


Hc35PQE-60-Rv primer 


14 


gene specific sense primer 


! 15 


gene specific antisense primer 


16 


StPotIA sense primer 


17 


StPotEB sense primer 


18 


StPotlA/B antisense primer 


19 


FWBacRECHl (5'-3') primer 


20 


FWBacRECH2 (5'-3') primer 


21 


RvRECH (3'-5') primer | 
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DESCRIPTION 


22 


N-terminal amino acid sequence of six domain PI 
precursor from N. a/a/a [Figure 1C] 


23 


Amino acid sequence of CI peptide from six domain PI 
precursor from N. alata [Figure 1C] 


24 


Amino acid sequence of Tl peptide from six domain PI 
precursor from N. alata [Figure 1C] 


25 


Amino acid sequence of T2 and T3 peptides from six 
domain PI precursor from N. alata [Figure 1C] 


26 


Amino acid sequence of T4 peptide from six domain PI 
precursor from N. alata [Figure 1C] 


27 


C-terminal amino acid sequence of six domain PI 
precursor from N. alata [Figure 1C] 


28-31 


Amino acid sequence of peptide fragment of 
Helicoverpa punctigera chymotrypsin [Figure 5B] 


32 


Amino acid sequence of chymotrypsin from H. 
armigera (CAA72966) [Figure 7] 


33 


Amino acid sequence of chymotrypsin from H. 
armizera (CAA72959) [Figure 7] 


JH 


Aminn anH ^prnience of chvmotrvDsin from H. 
armizera (CAA72960) [Figure 7] 


35 


Amino acid sequence of chymotrypsin from K 
armieera (CAA1295KS IFieure 71 


36 


Amino acid sequence of chymotrypsin from H. 
armizera (CAA72952) [Figure 7] 


17 
J I 


Amino acid seauence of chvmotrvDsin from H. 
armizera (CAA72951) [Figure 7] 


jo 


r WCtI pnmer [rigure oj 


39 


1 RVG4 primer [Figure 8] 


40 


Y79Fw primer [Figure 8] 


41 


Y72Fw primer [Figure 8] 


4/ 


Y72Rv pnmer [Figure 8] 


43 


Amino acid sequence of H. punctigera chymotrypsin 
(Fl Apcr) [Figure 9] 


44 


Amino acid sequence of K punctigera chymotrypsin 
CFlBpcr) [Figure 9] 


45 


Amino acid sequence of K punctigera chymotrypsin 
(F2Bpcr) [Figure 9] 


46 


Amino acid sequence of H. punctigera chymotrypsin 
(F3pcr) [Figure 9] 


47 


Amino acid sequence of H. punctigera chymotrypsin 
CF4pcr) [Figure 91 



WO 2004/094630 



PCT/AU2004/000524 



-13- 





. TYFSmrPTTON 


48 


Amino acid sequence of chymotrypsin from K 
punctigera (HpChlM) [Figure 10] 


49 


Amino acid sequence of chymotrypsin from H, 
punctigera (HpChlBI) [Figure 10] 


50 


Amino acid sequence of chymotrypsin from H. 
punctigera (HpCh2A) [Figure 10] 


51 


Amino acid sequence of chymotrypsin from K 
punctigera (HpCh2B) [Figure 10] 


52 


Amino acid sequence of chymotrypsin from H. 
punctigera (HpCh3A) [Figure 10] 


53 


Amino acid sequence of chymotrypsin from H. 
punctigera (HpCh4I) [Figure 10] 


54 


Amino acid sequence of chymotrypsin from H. 
punctigera (HpCh4II) [Figure 10] 


55 


Amino acid sequence of peptide from K punctigera 
chymotrypsin (Rechla) [Figure 11 A] 


56 


Amino acid sequence of peptide from K punctigera 
chymotrypsin (Rechlb) [Figure 11 A] 


57 


Amino acid sequence of peptide from H. punctigera 
chymotrypsin (Family la) [Figure 1 1A] 


58 


Amino acid sequence of peptide from H. punctigera 
chymotrypsin (Familylb) [Figure 11 A] 


59 


Amino acid sequence of peptide from H. punctigera 
chymotrypsin (Family2b) [Figure 1 1 A] 


60 


Amino acid sequence of peptide from K punctigera 
chymotrypsin (Family2a) [Figure 11 A] 


61 


Amino acid sequence of peptide from K punctigera 
chymotrypsin (Family3) [Figure 1 1 A] 


62 


Amino acid sequence of peptide from H. punctigera 
chymotrypsin (Family4) [Figure 1 1 A] 


63 


Amino acid sequence of H. punctigera chymotrypsin 
(HpChlAI) [Figure 13] 


64 


Amino acid sequence of K punctigera chymotrypsin 
(HpChlBI) [Figure 13] 


65 


Amino acid sequence of H. punctigera chymotrypsin 
(HpCh2B) [Figure 13] 


66 


Amino acid sequence of H. punctigera chymotrypsin 
(HpCh2A) [Figure 13] 


67 


Amino acid sequence of K punctigera chymotrypsin 
(HpCh3) [Figure 13] 


68 


Amino acid sequence of H. punctigera chymotrypsin 
(HpCh4I) [Figure 13] 


69 


Amino acid sequence of K punctigera chymotrypsin 
(HpCh5) [Figure 131 
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70 


Amino acid sequence of human brain trypsin 
(TrypsinlV) [Figure 13] 


71 


Amino acid sequence of chymotrypsin from H. 
armigera [Figure 14] 


72 


Amino acid sequence of chymotrypsin from H. 
punctigera [Figure 14] 


73 


Amino acid sequence of bovine chymotrypsin B (BOV 
CHB) [Figure 15] 


74 


Amino acid sequence of bovine chymotrypsin A (BOV 
CHA) [Figure 15] 


75 


Amino acid sequence from K punctigera (HpCh2A) 
[Figure 15] 


76 


Amino acid sequence from K punctigera (HpCh5) 
[Figure 15] 


77 


Amino acid sequence of potato inhibitor I family (PotI) 
{StPotIB} [Figure 24] 


78 


Amino acid sequence of potato inhibitor I family (PotI) 
{X67950} [Figure 24] 


79 


Amino acid sequence of potato inhibitor I family (PotI) 
{R01052} [Figure 24] 


80 


Amino acid sequence of potato inhibitor I family (PotI) 
{M17108} [Figure 24] 


81 


Amino acid sequence of potato inhibitor I family (PotI) 
{StPotIA} [Figure 24] 


82 


Amino acid sequence of potato inhibitor I family (PotI) 
{K03290} [Figure 24] 


83 


Amino acid sequence of potato inhibitor I family (PotI) 
{Z12619} [Figure 24] 


84 


Amino acid sequence of potato inhibitor I family (PotI) 
{X78988} [Figure 24] 


85 


Amino acid sequence of potato inhibitor I family (PotI) 
{EILXCH} [Figure 24] 


86 


Nucleotide sequence encoding endoplasmic reticulum 
peptide [Figure 28] 


87 


Amino acid sequence of endoplasmic reticulum peptide 
[Figure 28] 


88 


Nucleotide sequence of FwBacRECHl primer [Figure 

28] 


89 


Nucleotide sequence of FwBacRECH2 primer [Figure 

28] 


90 


Nucleotide sequence of HpF5 to which DNA encoding 
endoplasmic reticulum signal is to be added [Figure 28] 


91 


Amino acid sequence of HpCh5 to which endoplasmic 
reticulum signal is to be added [Figure 28] 
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92 


Nucleotide sequence of RvRECH primer [Figure 28] 


93 


Amino acid sequence of HpCHYl [Figure 4C] 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a graphical representation showing temperature stability of baculovirus 
expressed H. punctigera chymotrypsin. Bovine chymotrypsin (♦) and HpF5 (A) in 50 
5 mM Na acetate were compared by incubating 100 ^iL aliquots at 5°C intervals between 
40°C and 70°C. After chilling the heated samples on ice, duplicate activity assays were 
performed by incubating 10 of be or 50 |iL of HpF5 with substrate at 30°C. Residual 
activity was presented as a percentage of the activity of the untreated control. 

10 Figure 2 is a graphical representation showing growth of H. punctigera larvae fed on 
either low protein haricot bean artificial diet or cotton leaf artificial diet, in the presence or 
absence of 0.26% (w/v) NaPIs. Weight gain (mg) was monitored for 21 days. Larvae grew 
at a similar rate on both artificial diets and growth was retarded in the presence of NaPIs. 
(A) Growth of larvae. (B) Relative size of larvae at 21 days after feeding on cotton 

1 5 artificial diet, in the presence or absence of 0.26% NaPIs. Only five of the 20 larvae fed on 
NaPI survived and all five weighed less than control larvae after 21 days. (Q The effect of 
NaPIs on gut trypsin and chymotrypsin activity. When each H. punctigera larva reached 
the late fourth/ early fifth instar stage of development, the gut was removed and gut extract 
prepared prior to assays for trypsin and chymotrypsin activity. Extracts from individual 

20 larvae fed on control cotton-leaf artificial diet (CC) are indicated in grey and larvae fed on 
the same diet containing 0.26% ((w/v)) NaPI (NC) are indicated in black. Only four test 
larvae survived to fifth instar, whereas most control larvae survived. All assays were 
performed in duplicate. Trypsin activity was determined using BApNA substrate and 
chymotrypsin activity with SAAPFpNA substrate. Units of activity are expressed as 

25 change in absorbance at 405 nm/min/mg gut extract protein (± standard deviation). Trypsin 
activity was almost abolished, relative to controls, in extracts from NaPI-fed larvae, while 
chymotrypsin activity was low in two of the four NaPI-fed larvae. (D) The effect of NaPIs 
on trypsin and chymotrypsin activity in the frass. Frass was collected from each larva fed 
on the cotton leaf artificial diet and faecal extracts were prepared. Trypsin and 

30 chymotrypsin activity in extracts from larvae fed on the control diet (CC) are indicated in 
grey and larvae fed on the same diet containing 0.26% ((w/v)) NaPI (NC) are indicated in 
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black. Note that frass of larvae CC16 and NCI 3 were included in this analysis but 
excluded from Figure 2C because the gut were damaged during preparation of extracts. All 
assays were performed in duplicate. Trypsin activity was determined using BApNA 
substrate and chymotrypsin activity with SAAPFpNA substrate. Units of activity are 

5 expressed as change in absorbance at 405 nm/min/mg faecal extract protein (± standard 
deviation). Negligible trypsin activity was evident in extracts from NaPI-fed larvae, while 
chymotrypsin activity was elevated, relative to controls. (E) Levels of trypsin in the frass 
of control and NaPI fed larvae. Frass extracts were analysed on 15% ((w/v)) SDS-PAGE 
and transferred to nitrocellulose. The nitrocellulose filter was probed with rabbit anti- 

10 flpTRYl serum (1 in 2000 dilution) as the primary antibody, followed by the secondary 
antibody, donkey anti-rabbit IgG-horse radish peroxidase conjugate (1 in 2000 dilution). 
Immuno-reactive proteins were visualized using Enhanced Chemiluminescence (ECL) 
reagents and HyperfilmECL X-ray film. The trypsin in the frass of larvae that consumed 
NaPIs (NC) was significantly increased relative to controls (upper panel) but inactive 

15 (l° wer panel), presumably because it was in complex with the NaPIs. In comparison, 
trypsin was active in the frass of the control larvae (CC) even though the amount present 
was below the detection level of the antibody. (F) Production of NaPI-insensitive proteases 
in the gut of NaPI fed larvae. The gut extracts of the larvae were subjected to inhibition 
assays to identity NaPI-insensitive trypsins and chymotrypsins. Each extract was pre- 

20 incubated for 30 min at 30°C in the presence or absence of 80 nM NaPI inhibitor (Tl 
monomer for trypsin assays and CI monomer for chymotrypsin assays), prior to the 
addition of substrate to initiate the reaction. Results from extracts assayed without inhibitor 
are indicated in grey(controi larvae) and black (NaPI-fed larvae). Results from extracts 
assayed with inhibitor are stippled. All assays were performed in duplicate. Units of 

25 activity are expressed as substrate hydrolysis at 405 nm/min/mg extract protein (± standard 
deviation). (A) Inhibition of trypsin activity by Tl. Trypsin activity was determined using 
BApNA substrate. (B) Inhibition of chymotrypsin activity by CI. Chymotrypsin activity 
was determined with SAAPFpNA substrate. Tl almost totally inhibited trypsin activity in 
the extracts of control larvae, indicating these larvae did not produce NaPI-insensitive 

30 trypsins. The extracts from NaPI-fed larvae contained negligible trypsin activity, but this 
activity could not be inhibited by Tl. CI did not inhibit chymotrypsin activity in the 
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extracts of six control larvae and only partially inhibited the gut chymotrypsins in the rest 
of the controls indicating the control insects contained a CI -insensitive chymotrypsin. 
Most of the chymotrypsin activity in extracts from NaPI-fed larvae can be attributed to 
NaPI-insensitive chymotrypsins. When extracts from control and NaPI-fed larvae were 
5 pre-incubated with 80 nM chymostatin, all chymotrypsin activity was abolished. 

Figure 3 is a graphical representation showing the effect of various proteinase inhibitors 
on the chymotrypsin activity in unfractionated gut extracts from H. punctigera. Proteinase 
inhibitors were mixed with 1 \xg of protein from an unfractionated gut extract before 
10 incubation with the chymotrypsin substrate N-succinyl-Ala-Ala-Pro-Phe-p-nitroanilide. 
Inhibition is expressed as a percentage of the total activity in the control samples. 

Figure 4 is a photographic representation showing purification and N-terminal sequence 
of an NaPI inhibitable chymotrypsin from H. punctigera gut. (A and B) Protein gel 
15 analysis of fractions at various stages of purification using an affinity column with 
immobilized CI inhibitor (Figure 1). (A) 15% ((w/v)) SDS-polyacrylamide gel loaded 
with (a) unfractionated gut extract (b) and (c) unbound proteins. (B) 12.5% (w/v) SDS- 
polyacrylamide gel loaded with (d) wash fraction (e) protein bound to the CI column. (C) 
N-terminal sequence of the ~24 kDa protein in lane (e). 

20 

Figure 5 is a diagrammatic representation showing purification and N-terminal sequence 
of an NaPI-insensitive chymotrypsin from K punctigera. (A) PVDF blot of chymotrypsin 
(i) eluted from potato inhibitor column. Potato Inhibitor II (ii) and potato inhibitor I (iii) 
both co-eluted from the matrix under denaturing conditions. (B) N-terminal amino acid 
25 sequence obtained from PVDF blot. Rechla was the most abundant of the four sequences 
obtained. 

Figure 6 is a graphical representation showing the effect of pH and a range of substrates 
on the activity of the NaPI-insensitive chymotrypsin from H. punctigera midgut. SA 2 PF- 
30 pNA, N-succinyl-Ala-Ala-Pro-Phe-/?-nitroanilide; SA 2 PL-pNA, N-succinyl-Ala-Ala-Pro- 
Leu-/Miitroanilide; MA 2 PM-pNA, N-methoxysuccinyl-Ala-Ala-Pro-Met-p-nitroanilide. 
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Figure 7 is a diagrammatic representation showing the design of oligonucleotide primers 
for amplification of chymotrypsins from H. punctigera. An alignment of chymotrypsins 
from Helicoverpa armigera predicted from DNA sequences in the GenBank database. 
5 NCBI protein database accession numbers are left of the sequences. Regions 
corresponding to the oligonucleotide primers are boxed and the direction of amplification 
are indicated by arrows. 

Figure 8 is a diagrammatic representation showing oligonucleotide sequences used in 
1 0 RT-PCR amplification of Helicoverpa chymotrypsins. 

Figure 9 is a diagrammatic representation showing PCR products from amplification of 
cDNAs encoding H. punctigera chymotrypsins. PCR amplification of cDNA prepared 

from gut mRNA yielded partial sequence for five distinct chymotrypsins. The translated 

** 

1 5 sequence is aligned and the region corresponding to the PCR primers is boxed. 

Figure 10 is a diagrammatic representation showing alignment of predicted amino acid 
sequence of chymotrypsins from H. punctigera. The catalytic residues are marked by a 
solid triangle (T). The highly conserved active site motifs are highlighted with grey. The 
20 dipeptide R-I (U), conserved among all chymotrypsins is the site for the proposed 
cleavage of the activation peptide by trypsin. The residues that lie in the substrate binding 
pocket and confer substrate specificity are indicated by the symbols ¥,§,#. The cysteine (•) 
residues are highly conserved among all chymotrypsins. 

25 Figure 11 is a diagrammatic representation showing design of oligonucleotide primers for 
amplification of cDNA encoding the NaPI-insensitive chymotrypsins from K punctigera. 
(A) Comparison of the N-terminal sequence of two NaPI-insensitive chymotrypsins with 
Helicoverpa chymotrypsins predicted from the cDNA clones. The unique regions Fl and 
F2 are shaded. (B) Oligonucleotide primers complementary to unique regions at the N- 

30 terminus of the insensitive chymotrypsin. 
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Figure 12 is a diagrammatic representation showing nucleotide sequence and deduced 
amino acid sequence from the cDNA encoding the insensitive chymotrypsin. The 
nucleotide sequence of the insensitive chymotrypsin cDNA and deduced amino acid 
sequence. The amino acid sequence obtained from N-terminal sequence of purified protein 
5 is shaded in grey. The putative site for endoproteolytic cleavage by trypsin is shown by the 
arrow. The double underlined regions in the nucleotide sequence refer to the positions of 
the degenerate primers used for PCR amplification. The polyadenylation signal sequence is 
single underlined and an asterisk marks the stop codon. The deduced amino acid sequence 
of the putative activation peptide is numbered -40 to -1 followed by the mature domain 
10 (+1). The three amino acids that correspond to the catalytic residues are marked by the 
symbol #. The chymotrypsin substrate specificity residue, serine, located at the base of the 
primary substrate-binding pocket is marked with the symbol §. 

Figure 13 is a representation showing alignment of H. punctigera chymotrypsin families 
15 showing sequence identity. ClustalW alignment of members from the five families of H. 
punctigera chymotrypsins. Protein sequence is given in single letter code. Identical amino 
acids are coloured black, similar amino acids are grey. Amino acids are numbered on the 
right and gaps have been introduced to maximize the alignment. The NaPI- insensitive 
chymotrypsins are members of family 5 and are characterized by a unique arginine residue 
20 (arrowed) at position 185. Human trypsin IV also contains an arginine residue (arrowed) in 
a similar position. 

Figure 14 is a representation showing alignment of the H. punctigera NaPI-insensitive 
chymotrypsin with a homolog from K armigera that also has an arginine residue at 
25 position 185. 

Figure 15 is a representation showing the deduced protein sequences for the insensitive 
(HpCh5) and sensitive (HpCh2A) chymotrypsins from H. punctigera aligned to the bovine 
chymotrypsin isoforms A and B. H. punctigera chymotrypsins HpCh2A and HpChS were 
30 aligned to the bovine chymotrypsin isoforms A and B using ClustalW. The numbering 
system (excluding gaps) is according to the nomenclature of Greer, Proteins 7: 317-34, 
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1990 used for bovine chymotrypsin. Dots throughout the sequences represent conserved 
residues. The regions shaded in grey designate residues that form surface loops that are 
involved in recognition and binding of substrates or inhibitors. The primary substrate- 
binding pocket is formed by the regions labeled S1A, SIB and SIC. The SV site is formed 

5 by loops 35 and 60. Black boxes mark residues in the HpF5 sequence that differ 
significantly to amino acids in the corresponding positions in other chymotrypsins.Using 
the Greer, 1990, supra nomenclature these residues are Asp36, Arg 63, Thr72, Pro 83, Gly 
109, lieu 120, Glu insertion between 129 and 130, Glu 134, Serl45, Arg 192 and Pro 207. 
The boxed amino acids are removed from bovine chymotrypsins by autocatalytic cleavage 

1 0 that results in the formation of a-chymotrypsin. 

Figure 16 is a diagrammatic representation showing several surface loops in the structural 
model of if. punctigera chymotrypsin HpCh2A are larger than the cognate loops in bovine 
chymotrypsin. The structural model of H. punctigera F2A chymotrypsin (grey) was 
15 superimposed onto the structure of alpha-chymotrypsin from Bos taurus (black). Surface 
loops 60, 35, and 142 that are implicated in substrate recognition are larger in the insect 
chymotrypsin model. The chymotrypsin substrate specificity residue, serine 189, 
positioned at the base of the primary substrate-binding pocket is viewed as a space filled 
representation of the van der Waals radius. 

20 

Figure 17 is a representation showing glutamine 192 (Greer, 1990, supra nomenclature, 
Figure 15) of the sensitive chymotrypsin HpCh2A appears easily accommodated when 
modeled in complex with CI. Structural model of the sensitive chymotrypsin HpCh2A 
(grey) in complex with the proteinase inhibitor CI (black). The side chain of glutamine 
25 192 is arrowed The residues of CI in the vicinity of Gin 192 are represented in stick 
configuration (black). 

Figure 18 is a diagrammatic representation showing comparison of the environment 
surrounding residue 192 of the sensitive and insensitive chymotrypsin complexed to CL 
30 Enlarged view of the boxed area shown in Figure 17. Arginine 192 of the insensitive 
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chymotrypsin (B) appears to clash with residues of the CI inhibitor, in contrast there is no 
apparent conflict with glutamine 192 of the sensitive chymotrypsin (A). 

Figure 19 is a diagrammatic representation showing the environment surrounding arginine 
5 192 when the insensitive chymotrypsin is complexed to the Type 1 potato proteinase 
inhibitor (PotIB, Fig24). (A) Structural model of the insensitive chymotrypsin HpCh5 
(grey) in complex with the potato type I proteinase inhibitor PotI (black). (B) Enlarged 
view of the region around Arg 192 (boxed area in B). The side chain of arginine 192 is 
labeled. The residues of PotI in the vicinity of Arg 192 are represented in stick 
1 0 configuration (black). 

Figure 20 is a photographic representation showing expression of the chymotrypsin clone 
HpF2B in E. coli cells for the production of a polyclonal antibody. (A) The separation of 
total cell lysates taken at time points 0-5 hr after induction of HpCh2B on a 12.5% (w/v) 

15 SDS-PAGE gel stained with Coomassie Blue. The lanes are marked by the number of 
hours after induction and the arrow indicates the position of the induced protein with the 
correct predicted molecular mass. (B) Panel 1: Bacterially expressed HpCh2B purified on 
Talon resin (BD Biosciences Clontech), separated on a 15% (w/v) SDS-PAGE gel and 
stained with Coomassie Blue, Panel 2: Identical sample to Panel 1 transferred to 

20 nitrocellulose and immunostained with anti-HpCh2B antibodies. (C) Decreasing amounts 
(200, 150, 100, 75, 50, 25, 20, 10, 0 ng) of bacterially expressed chymotrypsin HpCh2B 
separated by SDS-PAGE and stained with silver and a protein blot of an identical gel 
probed with anti-chymotrypsin HpCh2B antibodies (1/2500). The H. punctigera antibody 
had a detection limit of 20 ng of bacterially expressed protein. 

25 

Figure 21 is a photographic representation showing specificity of antibodies raised against 
bacterially expressed NaPI-insensitive (HpChS) and sensitive (HpCh2B) chymotrypsins 
from H. punctigera. Bacterially expressed NaPI-insensitive (R) and sensitive (C) 
chymotrypsins were separated by SDS-PAGE on 12.5% (w/v) polyacrylamide gels and (A) 
30 stained with Coomassie Blue, (B) immunoblotted with an a-His tag antibody, (Q 
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immunoblotted with the antibody to the HpCh2B chymotrypsin (a-RC), and (D) 
immunoblotted with the antibody to the HpChS chymotrypsin (a-SC). 

Figure 22 are representations showing purification of PotI from potato tubers. PotI was 
5 purified from potato tubers (Russet Burbank) by acid extraction, ammonium sulphate 
precipitation and gel filtration. (A) SDS-PAGE stained with silver, lane 1: molecular size 
markers (kDa), lane 2: pooled PotI containing fractions from G-75 column, lane 3: 
immunoblot of lane 2 using an antibody raised in rabbits to a commercial preparation of 
PotI (Calbiochem) linked to keyhole limpet hemocyanin. PotI was identified as a single 
10 band with an approximate mass of 6 kDa. (B) RP-HPLC of pooled G-75 fractions from A. 
Peaks 1, 2 and 3 are PotI isoforms, peak 4 is a contaminating protein. 

Figure 23 is a graphical representation showing growth of H. armigera larvae on artificial 
diet containing NaPI and PotI. Growth of H. armigera, larvae fed on a cotton leaf artificial 

15 diet in the presence or absence of 0.26% (w/v) NaPI or 0.26% (w/v) NaPI plus 0.26% 
(w/v) PotI. The PotI was purified from potato tubers (var Russet Burbank), see Figure 22. 
Twenty five larvae were used on each diet. The weight of the larvae was measured at days 
7, 10, 12, 14 and 17 post egg hatch. At day 17, larvae fed NaPI alone were 84% of the 
control and larvae fed NaPI and PotI were 34% of the control. Two larvae fed on the 

20 control diet died, seven larvae fed the NaPI diet died and six larvae fed on the NaPI plus 
PotI diet died. 

Figure 24 is a representation showing the alignment of predicted amino acid sequence of 
StPotIA and StPotEB with members of the potato Inhibitor I family. ClustalW alignment of 

25 several members of the Potato Inhibitor I family. X67950: potato cDNA, (Beuning and 
Christeller, Plant Physiol 102: 1061, 1993), P01052: potato protein (Richardson and 
Cossins, FEBS Letters, 52: 161, 1975), M17108: potato genomic sequence (Cleveland et 
al, Plant Mol. Biol 8: 199-207, 1987), K03290: tomato (Graham et al, J. Biol Chem., 
260: 6555-6560, 1985) Z12619: tobacco (Lindhorst et al, Plant Mol Biol 21: 985-992, 

30 1993), X78988: maize (Jose Cordero et al, Plant J. 6: 141-150, 1994), EILXCH: leech 
(See Muller et al, Hoppe-Seyler's Z. Physiol Chem. 358: 1105-1117, 1977). * PI reactive 
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site. Both StPotIA and StPotIB are similar to other family members from potato. However, 
StPotIA has an additional four amino acids at position 41 to 44 that are also found in a 
wound induced PotI from tomato (K03290). StPotIB has a methionine at the PI site which 
is common for potato isolates. StPotIA has an alanine at the PI site which has not been 
5 reported for PotI isolates from potato, but is present in a PotI isolate from maize (X78988). 

Figure 25 is a graphical representation showing purification of E. coli expressed StPotIA 
and StPotIB. StPotIA and StPotIB were purified using the N-terminal HIS tag fused to 
StPotIA and StPotIB and a metal affinity resin followed by RP-HPLC. Profile after 

10 separation by RP-HPLC. Protein was eluted with a linear gradient of 0-100% Buffer B 
(80% (v/v) acetonitrile, 0.1% (v/v) TFA) at a flow-rate of 1 ml/min over 60 min. (A) 
StPotIA, (B) StPotIB. StPotIA eluted in one major peak at 36 min retention time (60% 
Buffer B) and StPotIB eluted in one major peak at 25 min retention time (42% Buffer B). 
The peaks were analyzed by SDS-PAGE and stained with silver (insert in A). Lane 1 is 

15 StPotIA (pkl), lane 2 is StPotIB (pk2) and lane 3 is purified PotI from potato tubers. 
StPotIA and StPotIB have a different apparent mobility (10 kDa) to a mix of PotI isoforms 
isolated from tubers (6 kDa), due to the additional HIS-tag epitope at the N terminus. 

Figure 26 is a graphical representation showing inhibition of NaPI-insensitive 
20 chymotrypsin by bacterially expressed StPotIA and StPotIB. Inhibition of the NaPI- 
insensitive chymotrypsin from the gut of R punctigera with purified PotI. (A) substrate 
SA 2 PFpNA, development time 30 min, (B) substrate SA 2 PLpNA, 30 min incubation. Mix 
of PotI isoforms from potato tuber (circle), StPotIA (square), StPotIB (triangle). StPotIA, 
StPotIB and the PotI from potato tubers were good inhibitors of the NaPI- insensitive 
25 chymotrypsin. 

Figure 27 is a graphical representation showing growth of K armigera larvae on 
transgenic cotton expressing NaPI and PotI. Transgenic cotton cv Coker 315 was used in 
bioassays with H. armigem. Thirty larvae were fed leaves of either the control 
30 untransformed Coker 315, transgenic line 1 (NaPI), transgenic line 2 (StPotIA) or 
transgenic plant 3 (NaPI X StPotIA). The weight of the larvae was measured at day 7 post- 
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egg hatch. (A) Growth of larvae. At day 7, larvae fed leaves expressing NaPI were 86% of 
the weight of the control larvae fed untransformed leaves. Larvae fed leaves expressing 
StPotIA were 92% of the control and larvae fed leaves expressing both NaPI and StPotIA 
were 46% of the control, (B) The effect of ingestion of NaPI and PotI on gut trypsin and 

5 chymotrypsin activity. Gut from the larvae in each experiment were pooled and extracts 
prepared. All assays were performed in duplicate. Trypsin activity (black) was determined 
using BApNA substrate and chymotrypsin activity (grey) with SA 2 PFpNA substrate. Units 
of activity are expressed as change in absorbance at 405 nm/min/ug gut extract protein. 
Trypsin activity was reduced, relative to the control, in the extracts from larvae fed leaves 

10 expressing NaPI and NaPI+StPotlA. Chymotrypsin activity was elevated in extracts from 
larvae fed leaves expressing NaPI or NaPI + StPotIA and reduced in extracts from larvae 
fed StPotIA alone. 

Figure 28 diagrammatic representation showing the nucleotide sequence and deduced 
15 amino acid sequence from the HpF5 cDNA encoding the NaPI-insensitive chymotrypsin 
and the location of the oligonucleotide primers used to add an endoplasmic reticulum 
sequence to HpCh5. The FwBacRECHl primer was used to add the first half of the ER 
signal sequence as well as a silent mutation, changing A to G to destroy the BamHl cut 
site. The FwBacRECH2 primer added the remainder of the coding sequence for the ER 
20 signal as well as a BamHi cut site to the 3' end of the sequence. The ER signal was added 
before the hexahistidine tag to enable purification of the expressed protein by metal 
affinity chromatography after cellular processing. The added amino acids are shaded in 
grey. 

25 Figure 29 is a photographic representation showing expression of the chymotrypsin clone 
HpF5 in baculovirus infected insect cells. Expressed proteins were separated on 12.5% 
(w/v) SDS-PAGE gels and subjected to immunoblots with the <x-HpCh5-antibody (A) 
RCDNA. Production of HpChS by HIGH FIVE (trademark) insect cells transfected with 
20 yl of bacmid DNA. Controls; (pFastBacVector) insect cells transfected with bacmid 

30 DNA transposed with pFastBac vector without the HpF5insert; (blue colony) insect cells 
transfected with untransposed bacmid DNA; (cells alone) untransfected insect cells; 
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(Cellfectin) insect cells treated with CELFECTIN (registered trademark) alone; The 
positive control, + is bacterially expressed HpCh5 with a hexahistidine tag. (B) Production 
of HpCh5 by HIGH FIVE (trademark) insect cells infected with virus. RC 24, 48 and 72. 
Medium collected 24, 48 and 72 hours after infection with HpF5 recombinant virus. 
5 Controls 24, 48 and 72. Medium from cells treated with virus prepared from 
nontransfected bachmid (blue colony) for 24, 48 and 72 hours. Positive control is 
bacterially expressed HpCh5 with a hexahistidine tag. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention is predicated in part on the identification and cloning of a novel 
insect chymotrypsin molecule termed "HpChS". cDNA encoding HpChS is referred to 
5 herein as "HpF5". The isolation of this molecule permits the identification and design of a 
range of products which are useful in controlling the growth, development and/or overall 
biological fitness of Helicoverpa spp. and other insects. These products generally act as 
antagonists of HpCh5 function or HpF5 gene expression and are useful as insect control 
agents. 

10 

The amino acid sequence of HpChS is set forth in SEQ ID NO:2. The nucleotide sequence 
of HpF5 is set forth in SEQ ID NOs:4 and 6. 

Reference herein to "HpCh5" should be understood as a reference to all forms of HpChS 
15 including, for example, any peptide isoforms which arise from alternative splicing of HpF5 
mRNA, mutants or polymorphic variants of HpCh5, any post-translation modified forms of 
HpChS or any non-post-translational modified forms of HpChS as well as any homolog in 
other insect species or strains. The term "HpChS" also encompasses members in a HpChS 
family of chymotrypsin molecules. To the extent that it is not specified, reference herein to 
20 HpChS includes derivatives, homologs, analogs, chemical equivalents and mimetics 
thereof. Reference to HpChS also refers to any variant having at least 75% amino acid 
identity to SEQ ID NO:2 after optimal alignment. Examples of variants of HpChS include 
Pi-sensitive variants such as those inter alia having an Arg 192 Gin or Arg 192 Asn 
substitution. Other variants include the N-terminal signal sequence of HpChS as defined in 
25 SEQ ID NO:3 and which is encoded by the nucleotide sequence set forth in SEQ ID NO:5. 
Such variants include a signal sequence comprising an amino acid sequence having at least 
about 75% similarity to SEQ ID NO:3 after optimal alignment or encoded by a nucleotide 
sequence having at least about 75% identity to SEQ ID NO:5 or a nucleotide sequence 
capable of hybridizing to SEQ ID NO:5 after optimal alignment. 

30 

Reference to "HpF5" should be understood as reference to all forms of HpF5 including any 
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cDNA isoform, genomic forms, mutants and polymorphic variants of HpF5 as well as any 
homologs from other species or strains of insect. The term "HpF5" also encompasses 
members of a HpF5 family of genes which encode HpChS or HpCh5-type chymotrypsins. 
To the extent that it is not specified, reference herein to HpF5 includes derivatives of HpF5 
5 as well as a nucleotide sequence having at least about 75% identity to SEQ ID NO:4 or 
SEQ ED NO:6 or a nucleotide sequence capable of hybridizing to SEQ ID NO:4 or SEQ ID 
NO:6 or its complementary form under low stringency conditions. The signal sequence of 
HpF5 as defined in SEQ ID NO:3 and encoded by SEQ ID NO:5 also encompasses 
variants thereof. As indicated above, such variants include a signal sequence having at 
10 least about 75% similarity to SEQ ID NO:3 after optimal alignment and/or being encoded 
by a nucleotide sequence capable of hybridizing to SEQ ID NO: 5 under low stringency 
conditions and/or a nucleotide sequence having at least about 75% identity to SEQ ID 
NO: 5 after optimal alignment. 

15 Before describing the present invention detail, it is to be understood that unless otherwise 
indicated, the subject invention is not limited to specific formulations of components, 
manufacturing methods, administration regimens, or the like, as such may vary. It is also to 
be understood that the terminology used herein is for the purpose of describing particular 
embodiments only and is not intended to be limiting. 

20 

It must be noted that, as used in the subject specification, the singular forms "a", "an" and 
"the" include plural aspects unless the context clearly dictates otherwise. Thus, for 
example, reference to an "agent" or "antagonist" includes a single agent or antagonist as 
well as two or more agents or antagonists and so forth. 

25 

In describing and claiming the present invention, the following terminology is used in 
accordance with the definitions set forth below. 

The terms "compound", "agent", "active agent" and "active" are used interchangeably 
30 herein to refer to a chemical compound which inhibits the activity of HpCh5 or the 
expression of a genomic gene corresponding to HpF5. The terms also encompass 
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agriculturally or horticultural active ingredients of those active agents specifically 
mentioned herein including but not limited to salts, esters, amides, prodrugs, active 
metabolites, analogs and the like. When the term "compound", "agent", "active agent" or 
"active" is used, then it is to be understood that this includes the agent per se as well as 
5 agriculturally or horticultural^ acceptable, physiologically active salts, esters, amides, 
prodrugs, metabolites, analogs, etc. The term "compound" is not to be construed as a 
chemical compound only but extends to peptides, polypeptides and proteins as well as 
genetic molecules such as RNA, DNA and chemical analogs thereof. 

10 The present invention contemplates, therefore, compounds useful in down-regulating the 
activity of HpCh5 or down-regulating expression of a genomic gene corresponding to 
HpCh5. The term "down-regulating" encompasses inhibition of HpCh5 activity. The 
inhibition of HpChS or reduction in its levels has the effect of reducing or retarding the 
growth of the insect. The inhibition of HpCh5 activity or HpF5 gene expression may occur 
15 by producing an inhibitor in a plant which is then consumed by the insect or the inhibitor 
may be topically applied to a plant or sprayed or otherwise dispersed to insects or a source 
of insects. In this regard, the plant may produce a nucleic acid molecule that interferes with 
HpF5 expression when consumed by the insect. Alternatively, the plant may produce a PI 
capable of inhibiting HpChS. Still in a further alternative, the HpCh5 inhibitor is a non- 
20 proteinaceous chemical applied to the surface of a plant or taken up by the root system of a 
plant. Reference herein to a "plant" is not to exclude trees or cultured tissues (e.g. callus) 
from a plant (or tree). 

Reference to compounds, agents and actives also includes combinations of compounds, 
25 agents or actives. Such combinations may be formulated in multi-part agricultural or 
horticultural compositions which are admixed together prior to dispersement or given 
sequentially. 

The terms "effective amount", "insecticidal effective amount" and "insect-static effective 
30 amount" of an agent as used herein mean a sufficient amount of the agent to reduce or 
retard insect growth and development and/or to kill or inhibit the insect. 
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By "agriculturally acceptable" or "horticultural^ acceptable" carrier, excipient or diluent 
is meant a vehicle comprised of a material that is not environmentally or otherwise 
undesirable to a plant or non-target insect. Carriers may include excipients and other 
5 additives such as diluents, detergents, coloring agents, wetting or emulsifying agents, pH 
buffering agents, preservatives, and the like. 

The terms "treating" and "treatment" as used herein in relation to plants or eradication of 
insects refer to reduction in severity of symptoms of insect infestation of a plant or the 
10 application of the agents to a group of insects resulting in retardation of their growth, 
development or biological fitness or wellbeing. 

The compounds of the present invention may be large or small molecules, nucleic acid 
molecules (including antisense or sense molecules), peptides, polypeptides or proteins or 
1 5 hybrid molecules such as RNAi- or siRNA-complexes, ribozymes or DNAzymes. 

The term "nucleic acid molecule" is also encompassed by the expression "genetic 
molecule" and includes hairpin constructs such as those which include RNAi-mediated 
post-transcriptional gene silencing or methylation-mediated silencing. 

20 

Accordingly, one aspect of the present invention provides an isolated nucleic acid 
molecule or derivative, homolog or analog thereof comprising a nucleotide sequence 
encoding a novel chymotrypsin protein or a derivative, homolog or mimetic thereof 
wherein said chymotrypsin is insensitive to a PI of N. alata. 

25 

More particularly, the present invention is directed to a nucleic acid molecule or derivative, 
homolog or analog thereof comprising a nucleotide sequence encoding, or a nucleotide 
sequence complementary to a nucleotide sequence encoding, an amino acid sequence 
substantially as set forth in SEQ ID NO:2 or a derivative, homolog or mimetic thereof or 
30 having at least about 75% or greater identity to SEQ ID NO:2 after optimal alignment or a 
nucleotide sequence set forth in SEQ ID NO:4 or SEQ ID NO:6 or a nucleotide sequence 
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having at least about 75% similarity or greater to SEQ ID NO:4 or SEQ ID NO:6 or a 
nucleotide sequence capable of hybridizing to SEQ ID NO:4 or SEQ ID NO:6 or its 
complementary form under low stringency conditions. 

5 Another aspect of the present invention provides an isolated chymotrypsin Helicoverpa 
ssp. wherein said chymotrypsin exhibits resistance to a PI from N. alata or a variant, 
derivative, homolog or analog of said chymotrypsin. 

More particularly, the isolated chymotrypsin comprises an amino acid sequence set forth in 
10 SEQ ED NO:2 or an amino acid sequence having at least about 75% similarity to SEQ ID 
NO:2 after optimal alignment. 

The term "similarity" as used herein includes exact identity between compared sequences 
at the nucleotide or amino acid level. Where there is non-identity at the nucleotide level, 

15 "similarity" includes differences between sequences which result in different amino acids 
that are nevertheless related to each other at the structural, functional, biochemical and/or 
conformational levels. Where there is non-identity at the amino acid level, "similarity" 
includes amino acids that are nevertheless related to each other at the structural, functional, 
biochemical and/or conformational levels. In a particularly preferred embodiment, 

20 nucleotide and amino acid sequence comparisons are made at the level of identity rather 
than similarity. 

Terms used to describe sequence relationships between two or more polynucleotides or 
polypeptides include "reference sequence", "comparison window", "sequence similarity", 

25 "sequence identity", "percentage of sequence similarity", "percentage of sequence 
identity", "substantially similar" and "substantial identity". A "reference sequence" is at 
least 12 but frequently 15 to 18 and often at least 25 or above, such as 30 monomer units, 
inclusive of nucleotides and amino acid residues, in length. Because two polynucleotides 
may each comprise (1) a sequence (i.e. only a portion of the complete polynucleotide 

30 sequence) that is similar between the two polynucleotides, and (2) a sequence that is 
divergent between the two polynucleotides, sequence comparisons between two (or more) 
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polynucleotides are typically performed by comparing sequences of the two 
polynucleotides over a "comparison window 3 ' to identify and compare local regions of 
sequence similarity. A "comparison window" refers to a conceptual segment of typically 
12 contiguous residues that is compared to a reference sequence. The comparison window 

5 may comprise additions or deletions (i.e. gaps) of about 20% or less as compared to the 
reference sequence (which does not comprise additions or deletions) for optimal alignment 
of the two sequences. Optimal alignment of sequences for aligning a comparison window 
may be conducted by computerized implementations of algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics 

10 Computer Group, 575 Science Drive Madison, WI, USA) or by inspection and the best 
alignment (i.e. resulting in the highest percentage homology over the comparison window) 
generated by any of the various methods selected. Reference also may be made to the 
BLAST family of programs as, for example, disclosed by Altschul et al (Nucl Acids Res. 
25: 3389-3402, 1997). A detailed discussion of sequence analysis can be found in Unit 

15 193 of Ausubel et al. ("Current Protocols in Molecular Biology" John Wiley & Sons Inc, 
1994-1998, Chapter 15, 1998). 

The terms "sequence similarity" and "sequence identity" as used herein refers to the extent 
that sequences are identical or functionally or structurally similar on a nucleotide-by- 

20 nucleotide basis or an amino acid-by-amino acid basis over a window of comparison. 
Thus, a "percentage of sequence identity", for example, is calculated by comparing two 
optimally aligned sequences over the window of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g. A, T, C, G, I) or the identical amino 
acid residue (e.g. Ala, Pro, Ser, Thr, Gly, Val, Leu, He, Phe, Tyr, Trp, Lys, Arg, His, Asp, 

25 Glu, Asn, Gin, Cys and Met) occurs in both sequences to yield the number of matched 
positions, dividing the number of matched positions by the total number of positions in the 
window of comparison (i.e., the window size), and multiplying the result by 100 to yield 
the percentage of sequence identity. For the purposes of the present invention, "sequence 
identity" will be understood to mean the "match percentage" calculated by the DNASIS 

30 computer program (Version 2.5 for windows; available from Hitachi Software engineering 
Co., Ltd., South San Francisco, California, USA) using standard defaults as used in the 
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reference manual accompanying the software. Similar comments apply in relation to 
sequence similarity. The term "similarity" is particularly useful to describe amino acid 
sequence comparisons. The term "identity" is particularly useful to describe nucleotide 
sequence comparisons. 

5 

Reference to at least about 75% identity or 75% similarity includes percentage identities 
and similarities greater than 75% such as 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 
83%, 84%, 85%, 86%, 87%, 88%,. 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 99% and 100%. 

10 

Reference herein to a low stringency means from at least about 0 to at least about 15% 
(v/v) formamide (including 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10% 11%, 12%, 13% 
and 14% (v/v) formamide) and from at least about 1 M to at least about 2 M salt for 
hybridization, and at least about 1 M to at least about 2 M salt for washing conditions. 

15 Generally, low stringency is at from about 25-30°C to about 42°C. The temperature may 
be altered and higher temperatures used to replace formamide and/or to give alternative 
stringency conditions. Alternative stringency conditions may be applied where necessary, 
such as medium stringency, which includes and encompasses from at least about 16% (v/v) 
to at least about 30% (v/v) formamide and from at least about 0.5 M to at least about 0.9 M 

20 salt for hybridization, and at least about 0.5 M to at least about 0.9 M salt for washing 
conditions, or high stringency, which includes and encompasses from at least about 31% 
(v/v) to at least about 50% (v/v) formamide and from at least about 0.01 M to at least about 
0,15 M salt for hybridization, and at least about 0.01 M to at least about 0.15 M salt for 
washing conditions. In general, washing is carried out T m = 69.3 + 0.41 (G+C)% (Marmur 

25 and Doty, J. Mol Biol 5: 109, 1962). However, the T m of a duplex DNA decreases by 1°C 
with every increase of 1% in the number of mismatch base pairs (Bonner and Laskey, Eur. 
J. Biochem. 46: 83, 1974). Formamide is optional in these hybridization conditions. 
Accordingly, particularly preferred levels of stringency are defined as follows: low 
stringency is 6 x SSC buffer, 0.1% (w/v) SDS at 25-42°C; a moderate stringency is 2 x 

30 SSC buffer, 0.1% (w/v) SDS at a temperature in the range 20°C to 65°C; high stringency is 
0.1 x SSC buffer, 0.1% (w/v) SDS at a temperature of at least 65°C. 



WO 2004/094630 



PCT/AU2004/000524 



-34- 

A further aspect of the present invention contemplates a nucleic acid molecule or 
derivative, homolog or analog thereof comprising a nucleotide sequence substantially as 
set forth in SEQ ID NO:4 or SEQ ID NO:6 or a nucleotide sequence having at least 75% or 

5 greater similarity to SEQ ID NO:4 or SEQ ID NO:6 or a derivative, homolog or analog 
thereof, or capable of hybridizing to SEQ ID NO:4 or SEQ ID NO:6 or its complementary 
form under low stringency conditions and which encodes an amino acid sequence 
substantially as set forth in SEQ ID NO:2 or a derivative, homolog or mimetic thereof or 
having at least 75% or greater identity to SEQ ID NO:2 after optimal alignment which 

10 nucleic acid molecule encodes a chymotrypsin which is insensitive to a proteinaceous 
inhibitor of K alata. 

Yet another aspect of the present invention contemplates nucleic acid molecule comprising 
a sequence of nucleotides substantially as set forth in SEQ ID NO:4 or SEQ ID NO:6. The 
15 nucleic acid molecule encoding HpCh5 is preferably a sequence of deoxyribonucleic acids 
such as a cDNA sequence or a genomic sequence. A genomic sequence may also comprise 
exons or introns. A genomic sequence may also include a promoter region or other 
regulatory regions. The present invention further contemplates isolated introns and exons 
of HpF5 such as those involved in genetic networking within a plant cell. 

20 

The nucleic acid molecule according to this aspect of the invention corresponds herein to 
HpF5. This cDNA has been determined, in accordance with the present invention, to 
encode a protein that defines a new family of chymotrypsins, family 5, within the group of 
chymotrypsin gene families, and this protein is referred to herein as HpChS. Reference to 

25 "HpF5" also includes a genomic form of the gene. Within the Helicoverpa punctigera 
chymotrypsin gene families, there are varying levels of homology as shown in Table 2. 
Family 5 is exemplified by HpF5, and is most similar to family 2A at 73% and least 
similar to family 4 at <20%. Without intending to limit the instant invention in any way, 
HpChS is exemplified by two unique stretches of sequence in the N-terminal (Fl and F2) 

30 and by six amino acid substitutions relative to NaPI sensitive chymotrypsins. Five of these 
substitutions did not appear to fall into functionally significant regions, whereas the sixth 
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substitution is associated with one of the P-strands that forms a wall of the primary 
substrate-binding pocket. The location of this substitution and conversion to an arginine, 
from glutamine, is highly unusual for the SI domain that is predominantly lined with non- 
polar residues that define chymotrypsin specificity 

5 



TABLE 2 

1 0 Percentage protein sequence identity between members of the H. punctigera 

chmotrypsin gene family 



H. punctigera 
chymotrypsin 
families 
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73 
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<20 


72 


HpCh4AI 




<20 



15 The present invention provides, therefore, an isolated protein having chymotrypsin activity 
which is not substantially inhibited by a PI from N. alata. Accordingly, another aspect of 
the present invention is directed to an isolated protein selected from the list consisting of: 

(i) a novel chymotrypsin protein or a derivative, homolog or mimetic thereof wherein 
20 said chymotrypsin is insensitive to the proteinase inhibitors of N. alata; 

(ii) a protein having an amino acid sequence substantially as set forth in SEQ ID NO:2 
or a derivative, homolog or mimetic thereof or having at least 75% or greater 
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identity to SEQ ID NO:2 after optimal alignment; 

(iii) a protein encoded by a sequence of nucleotides substantially as set forth in SEQ ID 
NO:4 or SEQ ID NO:6 or a derivative, homolog or analog thereof or a nucleotide 
5 sequence having at least 75% similarity to SEQ ID NO:4 or SEQ ID NO:6 or a 

sequence encoding an amino acid sequence substantially as set forth in SEQ ID 
NO:2 or a derivative, homolog or mimetic thereof or having at least 75% or greater 
similarity to SEQ ID NO:2 after optimal alignment; 

10 (iv) a protein encoded by a nucleic acid molecule or derivative, homolog or analog 
thereof comprising a nucleotide sequence substantially as set forth in SEQ ID NO:4 
or SEQ ID NO:6 or a derivative, homolog or analog thereof, or capable of 
hybridizing to SEQ ID NO:4 or SEQ ID NO:6 under low stringency conditions and 
which encodes an amino acid sequence substantially as set forth in SEQ ID NO:2 

15 or a derivative, homolog or mimetic thereof or having at least 75% or greater 

similarity to SEQ ID NO:2 after optimal alignment. 

(v) a novel chymotrypsin protein or a derivative, homolog or mimetic thereof that has 
an arginine substituted for an asparagine or glutamine in the primary substrate- 
20 binding pocket. 

The present invention discloses the amino acid, and corresponding cDNA sequence of a 
novel chymotrypsin that is insensitive to the Type II serine proteinase inhibitors produced 
by solanaceous species such as N. alata Therefore, this may be used as a target for agents 
25 to control insects carrying this insensitive proteinase. A number of compounds have been 
shown to inhibit the activity of HpChS, and a list of these compounds as preferred 
embodiments is found in Table 3. 
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TABLE3 

Effect of various proteinase inhibitors on the activity of the 
NaPI- insensitive chymotrypsins from H. punctigera and bovine chymotrypsin 
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5 



SBTI, soybean trypsin inhibitor; PMSF, phenylmethyl sulphonyl fluoride; 
Bowman Birk, soybean Bowman Birk inhibitor; lima bean, lima bean 
trypsin inhibitor (Sigma); Pot I, potato proteinase inhibitor Type I. Bovine 
chymotrypsin (BC). 

10 

Without limiting the mode of action of any of these compounds to any one activity, 3-D 
modeling is used to investigate the binding ability of PotI and the peptide encoded by the 
CI domain of NaPI to both NaPI-insensitive and NaPI-sensitive insect chymotrypsins. 

15 The deduced amino acid sequences from the cDNA clones HpF2A (NaPI-sensitive) and 
HpF5 (NaPI-insensitive) were modeled on the structures of the Solenopsis imicta (fire ant) 
and Bos taunts (cow) chymotrypsins obtained from the Research Collaborator for 
Structural Bioinformatics (RCSB) Protein Data Bank. 

The Helicoverpa chymotrypsins are predicted to adopt similar structures to those reported 
20 for all the chymotrypsin structures available in the data bank. The modeled structures have 
the classic serine protease fold consisting of two, six-stranded anti-parallel beta barrels 
with the catalytic triad located between the two domains. Certain surface loops are cleaved 
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in the mammalian chymotrypsins (loop 142), but remain intact within insect 
chymotrypsins. Therefore, the fire ant chymotrypsin structure (Botos et al, J. Mol. Biol. 
298: 895-901, 2000 ) was required to help refine the orientation of these surface loops in 
the Helicoverpa chymotrypsin models. Two surface loops, 60 and 142 are considerably 
5 larger in the H. punctigera chymotrypsins (Figures 15,16). 

CI was modeled in complex with NaPI-sensitive and NaPI-insensitive chymotrypsins to 
investigate what residues in the NaPI-insensitive chymotrypsin might be involved in the 
loss of inhibitor binding. The structure of the chymotrypsin inhibitor CI was previously 
10 determined by *H NMR (Nielson et al, 1994, supra) but has not been determined in a 
proteinase complex. Therefore the related proteinase inhibitor PCI-1 from Solarium 
tuberosum in complex with Proteinase B from Streptomyces griseus (Greenblatt et al, J. 
Mol. Biol. 205: 201, 1989) provided an appropriate basis guiding the alignment of the 
complexes. Energy minimization of the CI -chymotrypsin complexes revealed Argl92 
15 (chymotrypsin numbering system) as the only likely candidate to cause such resistance 
from a possible 24 putative contact residues. The NaPI-insensitive chymotrypsin 
containing the Argl92 (Greer nomenclature, Greer, Proteins 7:317-34, 1990. Figure 15) 
residue in complex with CI could not be properly energy minimized due to steric contacts 
between Argl92 and CI whereas the Glnl92 residue in the Na-PI sensitive chymotrypsin 
20 in complex with CI caused no such problems due to its much smaller size. Figure 17 
shows a close up view of the binding region surrounding Glnl92 in the Cl-HpF2A 
chymotrypsin complex. It is clear that Glnl92 is not in conflict with any regions on the 
inhibitor molecule. However, comparison to the cognate Arg residue in the Cl-HpCh5 
chymotrypsin model demonstrates there is not enough space to accommodate this much 
25 larger residue (Figure 18) making contact with Thr5 and Ala9 in CI. Furthermore, 
modeling the StPotlA inhibitor into HpCh5 revealed that the NaPI-insensitive 
chymotrypsin could accommodate the Argl92 residue consistent with the inhibition of this 
chymotrypsin by StPotlA (Figure 19). 

30 In summary, the NaPI-insensitive chymotrypsin from Helicoverpa species has an arginine 
in place of an asparagine or glutamine at position 192 that extends into the SI binding 
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pocket and appears to interfere with CI binding. Furthermore, it is clear that this Arg 
residue does not interfere with PotI binding, consistent with the observation that PotI is a 
much more efficient inhibitor of insect chymotrypsins than the NaPI inhibitors. Large 
quantities of the PotI inhibitor were purified from potato tubers (Figure 22) to evaluate the 
5 combined effect of NaPI and PotI on the growth of H. armigera larvae (Figure 23). 
Bioassays confirmed that PotI significantly enhances the activity of the NaPI inhibitors. 
Caterpillars fed NaPI and PotI in combination (0.26 and 0.34% (w/v), respectively) were 
34% the size of control larvae at the fifth instar stage of development whereas caterpillars 
feeding on NaPIs alone were about 84% the size of the controls. 

10 

Therefore, another aspect of the present invention provides a method for modulating 
activity of the HpChS or a homolog or variant thereof in an insect, said method comprising 
contacting the HpChS protein or its homolog or variant with an effective amount of an 
agent for a time and under conditions sufficient to decrease or increase HpChS activity. 

15 

Yet another aspect of the present invention provides a method for modulating expression 
of HpF5 or homolog or variant in an insect, said method comprising contacting HpF5 or its 
homolog or variant with an effective amount of an agent for a time and under conditions 
sufficient to decrease or increase HpF5 expression. . 

20 

The preferred insects targeted in accordance with these and other aspects are species of 
Helicoverpa and other Lepidopteran species. In addition, plants to be protected include 
those sensitive to K armigera, K punctigera, H. zea and H. virescens. Such plants include 
the H. armigera sensitive plants such as cotton, sweet corn, tomato, tobacco, pimento, 
25 potato, sunflower, citrus, plums, sorghum, leeks, soybean, alfalfa, beans, pidgeon peas, 
chick peas, artichokes, curcurbits, lettuce, Dianthus (an ornamental plant), geraniums, cape 
gooseberry, maize, flax and linseed, alfalfa, lupins, broad beans, garden peas, peanuts, 
canola, snapdragons, cherry, sunflower, pot marigolds and Helichrysum (an ornamental 
plant) 

30 

Other plants contemplated herein include cereals (such as wheat, barley, oats, triticale, 
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etc), horticultural plants (e.g. apples, carrots, onions, etc.), ornamental plants (such as 
orchids, roses, petunias, etc.) and trees. 

The present invention contemplates, therefore, methods of screening for compounds which 
5 inhibit or act as antagonists of HpCh5 activity or HpF5 gene expression. For example, one 
method involves contacting a candidate compound with HpChS. The screening procedure 
includes assaying (i) for the presence of a complex between the compound and HpChS, or 
(ii) an alteration in the expression levels of HpF5 cDNA or genomic DNA. One form of 
assay involves competitive binding assays. In such competitive binding assays, HpCh5 is 

10 typically labeled. Free HpChS is separated from any putative complex and the amount of 
free (i.e. uncomplexed) label is a measure of the binding of the agent being tested to bind 
to HpChS. One may also measure the amount of bound, rather than free, HpChS. It is also 
possible to label the compound rather than HpChS and to measure the amount of 
compound binding to target in the presence and in the absence of the compound being 

15 tested. Such compounds may inhibit HpChS. A similar approach may be adopted for 
compounds which bind to and inhibit HpF5 or mRNA transcripts thereof. 

Another technique for agent screening provides high throughput screening for compounds 
having suitable binding affinity to HpChS and is described in detail in Geysen 

20 (International Patent Publication No. WO 84/03564). Briefly stated, large numbers of 
different small peptide test compounds are synthesized on a solid substrate, such as plastic 
pins or some other surface. The peptide test compounds are reacted with HpChS and 
washed. Bound HpChS molecules are then detected by methods well known in the art. This 
method may be adapted for screening for non-peptide, chemical entities. This aspect, 

25 therefore, extends to combinatorial approaches including phage display to screen for 
HpChS antagonists. 



Purified HpChS can be coated directly onto plates for use in the aforementioned agent 
screening techniques. However, non-neutralizing antibodies to HpChS may also be used to 
30 immobilize HpChS on the solid phase. 



WO 2004/094630 



PCT/AU2004/000524 



-41 - 

Live animals such as K armigera and/or H. puncttgera may also be used in feeding trials 
to find potential inhibitors. 

The present invention also contemplates the use of competitive agent screening assays in 
5 which neutralizing antibodies capable of specifically binding HpCh5 compete with a test 
compound for binding to HpChS or fragments thereof. In this manner, the antibodies can 
be used to detect the presence of any peptide which shares one or more antigenic 
determinants of HpChS. 

10 Yet another useful source of analogs of compounds which are chemically modified may be 
used to induce feed-back inhibition of biochemical or genetic pathways for generating 
authentic HpChS. 

Analogs of HpChS contemplated herein include but are not limited to modification to side 
15 chains, incorporating of unnatural amino acids and/or their derivatives during peptide, 
polypeptide or protein synthesis and the use of crosslinkers and other methods which 
impose conformational constraints on HpCh5. 

Examples of side chain modifications of HpCh5 contemplated by the present invention 
20 include modifications of amino groups such as by reductive alkylation by reaction with an 

aldehyde followed by reduction with NaBfLj; amidination with methylacetimidate; 

acylation with acetic anhydride; carbamoylation of amino groups with cyanate; 

trinitrobenzylation of amino groups with 2, 4, 6-trinitrobenzene sulphonic acid (TNBS); 

acylation of amino groups with succinic anhydride and tetrahydrophthalic anhydride; and 
25 pyridoxylation of lysine with pyridoxal-5-phosphate followed by reduction with NaBKL,. 

The guanidine group of arginine residues may be modified by the formation of 
heterocyclic condensation products with reagents such as 2,3-butanedione, phenylglyoxal 
and glyoxal. 
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The carboxyl group may be modified by carbodiimide activation via O-acylisourea 
formation followed by subsequent derivitization, for example, to a corresponding amide. 

Sulphydryl groups may be modified by methods such as carboxymethylation with 
5 iodoacetic acid or iodoacetamide; performic acid oxidation to cysteic acid; formation of a 
mixed disulphides with other thiol compounds; reaction with maleimide, maleic anhydride 
or other substituted maleimide; formation of mercurial derivatives using 4- 
chloromercuribenzoate, 4-chloromercuriphenylsulphonic acid, phenylmercury chloride, 2- 
chloromercuri-4-nitrophenol and other mercurials; carbamoylation with cyanate at alkaline 
10 pH. 

Tryptophan residues may be modified by, for example, oxidation with N- 
bromosuccinimide or alkylation of the indole ring with 2-hydroxy-5-nitrobenzyl bromide 
or sulphonyl halides. Tyrosine residues on the other hand, may be altered by nitration with 
1 5 tetranitromethane to form a 3-nitrotyrosine derivative. 

Modification of the imidazole ring of a histidine residue may be accomplished by 
alkylation with iodoacetic acid derivatives or N-carbethoxylation with 
diethylpyrocarbonate. 

20 

Examples of incorporating unnatural amino acids and derivatives during peptide synthesis 
include, but are not limited to, use of norleucine, 4-amino butyric acid, 4-amino-3- 
hydroxy-5-phenylpentanoic acid, 6-aminohexanoic acid, t-butylglycine, norvaline, 
phenylglycine, ornithine, sarcosine, 4-amino-3-hydroxy-6-methylheptanoic acid, 2-thienyl 
25 alanine and/or D-isomers of amino acids. A list of unnatural amino acid, contemplated 
herein is shown in Table 4. 
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TABLE4 

Codes for non-conventional amino acids 



5 


Non-conventional 


Code 


Non-conventional 


Code 




amino acid 




amino acid 






a-aminobutyric acid 


Abu 


L-N-methylalanine 


Nmala 


10 


a-amino-a-methylbutyrate 


Mgabu 


L-N-methylarginine 


Nmarg 




aminocyclopropane- 


Cpro 


L-N-methylasparagine 


Nmasn 




carboxylate 




L-N-methylaspartic acid 


Nmasp 




aminoisobutyric acid 


Aib 


L-N-methylcysteine 


Nmcys 




aminonorbornyl- 


Norb 


L-N-methylglutamine 


Nmgln 


15 


carboxylate 




L-N-methylglutamic acid 


Nmglu 




cyclohexylalanine 


Chexa 


L-Nmethylhistidine 


Nmhis 




cyclopentylalanine 


Cpen 


L-N-methylisolleucine 


Nmile 




D-alanine 


Dal 


L-N-methylleucine 


Nmleu 




D-arginine 


Darg 


L-N-methyllysine 


Nmlys 


20 


D-aspartic acid 


Dasp 


L-N-methylmethionine 


Nmmet 




D-cysteine 


Dcys 


L-N-methylnorleucine 


Nmnle 




D-glutamine 


Dgln 


L-N-methylnorvaline 


Nmnva 




D~erlutamic acid 


Dglu 


L-N-methylornithine 


Nmom 




D-histidine 


Dhis 


L-N-methylphenylalanine 


Nmphe 


25 


D-isoleucine 


Dile 


L-N-methylproline 


Nmpro 




D-leucine 


Dleu 


L-N-methylserine 


Nmser 




D-lysine 


Dlys 


L-N-methylthreonine 


Nmthr 




D-methionine 


Dmet 


L-N-methyltxyptophan 


Nmtrp 




D-ornithine 


Dorn 


L-N-methyltyrosine 


Nmtyr 


30 


D-phenylalanine 


Dphe 


L-N-methylvaline 


Nmval 




D-proline 


Dpro 


L-N-methylethylglycine 


Nmetg 




D-serine 


Dser 


L-N-methyl-t-butylglycine 


Nmtbug 
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D-threonine 


Dthr 


L-norleucine 


Nle 




D-tryptophan 


Dtrp 


L-norvaline 


Nva 




D-tyrosine 


Dtyr 


a-methyl-aminoisobutyrate 


Maib 




D-valine 


Dval 


a-methyl-y-aminobutyrate 


Mgabu 


5 


D-a-methylalanine 


Dmala 


a-methylcyclohexylalanine 


Mchexa 




D-a-methylarginine 


Dmarg 


a-methylcylcopentylalanine 


Mcpen 




D-a-methylasparagine 


Dmasn 


a-methyl-a-napthylalanine 


Manap 




D-a-methylaspartate 


Dmasp 


a-methylpenicillamine 


Mpen 




D-a-methylcysteine 


Dmcys 


N-(4-aminobutyl)glycine 


Nglu 


10 


D-a-methylglutamine 


Dmgln 


N-(2-aminoethyl)glycine 


Naeg 




D-a-methylhistidine 


Dmhis 


N-(3-aminopropyl)glycine 


Norn 




D-a-methylisoleucine 


Dmile 


N-amino-a-methylbutyrate 


Nmaabu 




D-a-methylleucine 


Dmleu 


a-napthylalanine 


Anap 




D-a-methyllysine 


Dmlys 


N-benzylglycine 


Nphe 


15 


D-a-methylmethionine 


Dmmet 


N-(2-carbamylethyl)glycine 


Ngln 




D-a-methylornithine 


Dmorn 


N-(carbamylmethyl)glycine 


Nasn 




D-a-methylphenylalanine 


Dmphe 


N-(2-carboxyethyl)glycine 


Nglu 




D-a-methylproline 


Dmpro 


N~(carboxymethyl)glycine 


Nasp 




D-a-methylserine 


Dmser 


N-cyclobutylglycine 


Ncbut 


20 


D-a-methylthreonine 


Dmthr 


N-cycloheptylglycine 


Nchep 




D-a-methyltryptophan 


Dmtrp 


N-cyclohexylglycine 


Nchex 




D-a-methyltyrosine 


Dmty 


N-cyclodecylglycine 


Ncdec 




D-a-methylvaline 


Dmval 


N-cylcododecylglycine 


Ncdod 




D-N-m eth vl al anine 


Dnmala 


N-cyclooctylglycine 


Ncoct 


25 


D-N-methylarginine 


Dnmarg 


N-cyclopropylglycine 


Ncpro 




D-N-methylasparagine 


Dnmasn 


N-cycloundecylglycine 


Ncund 




D-N-methylaspartate 


Dnmasp 


N-(2,2-diphenylethyl)glycine 


Nbhm 




D-N-methylcysteine 


Dnmcys 


N-(3,3-diphenylpropyl)glycine 


Nbhe 




D-N-methylglutamine 


Dnmgln 


N-(3-guanidinopropyl)glycine 


Narg 


30 


D-N-methylglutamate 


Dnmglu 


N-(l-hydroxyethyl)glycine 


Nthr 
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D-N-methylhistidine 


Dnmhis 


N-(hydroxyethyl))glycine 


Nser 




D-N-methylisoleucine 


Dnmile 


N-(imidazolylethyl))glycine 


Nhis 




D-N-methylleucine 


Dnmleu 


N-(3-indolylyethyl)glycine 


Nhtrp 




D-N-methyllysine 


Dnmlys 


N-methyl-y-aminobutyrate 


Nmgabu 


5 


N-methylcyclohexylalanine Nmchexa 


D-N-methylmethionine 


Dnmmet 




D-N-methylornithine 


Dnmorn 


N-methylcyclopentylalanine 


Nmcpen 




N-methylglycine 


Nala 


D-N-methylphenylalanine 


Dnmphe 




N-methylaminoisobutyrate 


Nmaib 


D-N-methylproline 


Dnmpro 




N-(l -methylpropyl)glycine 


Nile 


D-N-methylserine 


Dnmser 


10 


N-(2-methylpropyl)glycine 


Nleu 


D-N-methylthreonine 


Dnmthr 




D-N-methyltryptophan 


Dnmtrp 


N-(l -methylethyl)glycine 


Nval 




D-N-methyltyrosine 


Dnmtyr 


N-methyla-napthylalanine 


Nmanap 




D-N-methylvaline 


Dnmval 


N-methylpenicillamine 


Nmpen 




y-aminobutyric acid 


Gabu 


N-(p-hydroxyphenyl)glycine 


Nhtyr 


15 


L-f-butylglycine 


Tbug 


N-(thiomethyl)glycine 


Ncys 




L-ethylglycine 


Etg 


penicillamine 


Pen 




L-homophenylalanine 


Hphe 


L-a-methylalanine 


Mala 




L-oc-methylarginine 


Marg 


L-oc-methylasparagine 


Masn 




L-a-methylaspartate 


Masp 


L-a-methyl-r-butylglycine 


Mtbug 


20 


L-a-methylcysteine 


Mcys 


L-methylethylglycine 


Metg 




L-a-methylglutamine 


Mgln 


L-a-methylglutamate 


Mglu 




L-a-methylhistidine 


Mhis 


L-a-methylhomophenylalanine 


Mhphe 




L-a-methylisoleucine 


Mile 


N-(2-methylthioethyl)glycine 


Nmet 




L-a-methylleucine 


Mleu 


L-a-methyllysine 


Mlys 


25 


L-a-methylmethionine 


Mmet 


L-a-methylnorleucine 


Mnle 




L-a-methylnorvaline 


Mnva 


L-a-methyloniithine 


Morn 




L-a-methylphenylalanine 


Mphe 


L-a-methylproline 


Mpro 




L-a-methylserine 


Mser 


L-a-methylthreonine 


Mthr 




L-a-methyltryptophan 


Mtrp 


L-a-methyltyrosine 


Mtyr 


30 


L-oc-methylvaline 


Mval 


L-N-methylhomophenylalanine 


Nmhphe 
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N-(N-(2,2-diphenylethyi) Nnbhm 
carbamylmethyl)glycine 
l-carboxy-l-(2,2-diphenyl- Nmbc 
ethylamino)cyclopropane 



N-(N-(3,3-diphenylpropyl) 
carbamylmethyl)glycine 



Nnbhe 



5 



Crosslinkers can be used, for example, to stabilize 3D conformations, using homo- 
Afunctional crosslinkers such as the Afunctional imido esters having (CH 2 ) n spacer groups 
with n=l to n=6, glutaraldehyde, N-hydroxysuccinimide esters and hetero-bifunctional 

10 reagents which usually contain an amino-reactive moiety such as N-hydroxysuccinimide 
and another group specific-reactive moiety such as maleimido or dithio moiety (SH) or 
carbodiimide (COOH). In addition, peptides can be conformationally constrained by, for 
example, incorporation of C a and N a -methylamino acids, introduction of double bonds 
between C a and Cp atoms of amino acids and the formation of cyclic peptides or analogs 

15 by introducing covalent bonds such as forming an amide bond between the N and C 
termini, between two side chains or between a side chain and the N or C terminus. 

Such analogs, especially if they retain activity or even the HpCL5 molecule itself may 
have indistinct applications such as in washing powder or as in a stain removal 
20 formulation. 

Another aspect of the present invention contemplates any compound which binds or 
otherwise interacts with HpChS or its derivatives or variants or which induces feed-back 
inhibition of HpCh5 synthesis resulting in down-regulation of HpCh5 activity or levels. 



The present invention is also useful for screening for other compounds which reduce 
expression of HpF5. A variety of agent screening techniques may be employed such as 
those described herein and in International Publication No. WO 97/02048. 

30 A compound antagonist includes a variant of HpChS such as a variant comprising an 
analog amino acid residue as indicated above. In one embodiment, the target is the HpChS 



25 
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polypeptide. The term "polypeptide" refers to a polymer of amino acids and its equivalent 
and does not refer to a specific length of the product, thus, peptides, oligopeptides and 
proteins are included within the definition of a polypeptide. This term also does not 
exclude modifications of the polypeptide, for example, glycosylates, aceylations, 
5 phosphorylations and the like. Included within the definition are, for example, 
polypeptides containing one or more analogs of an amino acid (including, for example, 
unnatural amino acids such as those given in Table 4) or polypeptides with substituted 
linkages. 

1 0 A substance identified as an antagonist of HpChS function or HpF5 gene activity may be a 
peptide or non-peptide in nature. Non-peptide "small molecules" are often preferred for 
many agricultural or horticultural purposes due to their perceived stability. 

There are several steps commonly taken in the design of a mimetic-type antagonist of 
15 HpChS from a compound. First, the particular parts of the compound that are critical 
and/or important in determining the target property are determined. In the case of a 
peptide, this can be done by systematically varying the amino acid residues in the peptide, 
e.g. by substituting each residue in turn. Alanine scans of peptides are commonly used to 
refine such peptide motifs. These parts or residues constituting the active region of the 
20 compound are known as its "agrichemicaphore". 

Once the agrichemicaphore has been found, its structure is modeled according to its 
physical properties, e.g. stereochemistry, bonding, size and/or charge, using data from a 
range of sources, e.g. spectroscopic techniques, x-ray diffraction data and NMR. 
25 Computational analysis, similarity mapping (which models the charge and/or volume of a 
agrichemicaphore, rather than the bonding between atoms) and other techniques can be 
used in this modeling process. 

In a variant of this approach, the three-dimensional structure of HpCh5 and a compound 
30 binding it. This can be especially useful where HpChS or its antagonist change 
conformation on binding, allowing the model to take account of this in the design of the 
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mimetic. Modeling can be used to generate inhibitors which interact with the linear 
sequence or a three-dimensional configuration. 

A template molecule is then selected onto which chemical groups which mimic the 
5 pharmacophore can be grafted. The template molecule and the chemical groups grafted 
onto it can conveniently be selected so that the agrichemicaphore is easy to synthesize and 
is likely to be agriculturally or horticulturally acceptable. Alternatively, where the 
agrichemicaphore is peptide-based, further stability can be achieved by cyclizing the 
peptide, increasing its rigidity. The agrichemicaphore or agrichemicaphores found by this 
10 approach can then be screened to see whether they have HpChS antagonistic property, or to 
what extent they exhibit it. Further optimization or modification can then be carried out to 
arrive at one or more final agents for testing. 

Yet another aspect of the present invention provides a method for detecting an agent 
15 capable of binding or otherwise associating with a HpCh5 binding site or functional 
equivalent thereof said method involving the use of in-silico 3-D modeling to identify 
compounds that bind to HpCh5 and specifically, are not interfered with by Arginine 192. 

The goal of rational HpCh5 antagonist design is to produce structural analogs of HpCh5 or 
20 of small molecules with which HpChS interacts (e.g. an antagonist or inhibitor) in order to 
fashion agents which are, for example, more inhibitory of HpCh5. See, e.g. Hodgson 
{Bio/Technology 9: 19-21, 1991). In one approach, one first determines the three- 
dimensional structure of HpCh5 by x-ray crystallography, by computer modeling or, most 
typically, by a combination of approaches. Useful information regarding the structure of a 
25 polypeptide may also be gained by modeling based on the structure of chymotrypsins. An 
example of rational drug design is the development of HIV protease inhibitors (Erickson et 
dL % Science 249: 527-533, 1990). In addition, target molecules may be analyzed by an 
alanine scan (Wells, Methods Enzymol 202: 2699-2705, 1991). In this technique, an amino 
acid residue is replaced by Ala and its effect on the peptide's activity is determined. Each 
30 of the amino acid residues of the peptide is analyzed in this manner to determine the 
important regions of the peptide. 
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In addition, compounds including antagonists may be directed to particular locations or 
regions or domains or HpCh5. 

5 It is also possible to isolate a HpCh5 -specific antibody and then to solve its crystal 
structure. In principle, this approach yields an agricore upon which subsequent agent 
design can be based. It is possible to bypass protein crystallography altogether by 
generating anti-idiotypic antibodies (anti-ids) to a functional antibody. As a mirror image 
of a mirror image, the binding site of the anti-ids is expected to be an analog of the original 
10 receptor. The anti-id could then be used to identify and isolate peptides from banks of 
chemically or biologically produced banks of peptides. Selected peptides would then act as 
the agrichemicaphore. 

Chymotrypsin clone HpF2B is expressed in E. coli fused to a six histidine (6.H) tag at the 
15 C-terminus and is purified to homogeneity on Talon metal affinity resin (Figure 20) for 
injection into a rabbit for production of polyclonal antibodies. N-terminal sequencing of 
the purified product confirmed the expression of the chymotrypsin HpCh2B. After the 
fourth boost with antigen, the serum is collected and tested on protein blots of bacterially 
expressed protein and unfractionated gut extracts. The antibody detected the full-length 
20 recombinant chymotrypsin at a dilution of 1 in 2500 as well as several break-down 
products. Unfractionated gut extract and a sample of protein bound to the CI affinity 
column were also stained with the anti-HpCh2B antibody which detected the mature native 
form of the enzyme. 

25 Purified 6H.HpCh2B is used to test the detection limit of anti-HpCh2B antibody by 
comparison of immunoblots to silver stained SDS-PAGE gels. The antibody detected 20 
ng of bacterially expressed chymotrypsinogen and also recognized the mature form of the 
native chymotrypsin isolated from gut of K punctigera. 

30 The cDNA (HpF5) encoding the NaPI-insensitive chymotrypsin (HpChS) is expressed in 
E. coli in a similar manner except the six-histidine tag is fused to the N-terminus of the 
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expresseci protein. The polyclonal antiserum that is raised against the bacterially expressed 
chymotrypsin HpCh2B did not cross react with bacterially expressed NaPI- insensitive 
chymotrypsin (HpCh5) on protein blots (Figure 21). Likewise the antiserum raised against 
HpCh5 did not bind to HpCh2B. This indicates that these antisera can be used to 
5 specifically distinguish between and monitor levels of the NaPI -insensitive and sensitive 
chymotrypsins in unfractionated gut extracts 

Accordingly, still another aspect of the present invention is directed to antibodies to 
HpChS and HpCh2B including catalytic antibodies. 

10 

In another aspect of the present invention, a method is provided for the isolation of and 
separation of individual isoforms of chymotrypsin, said method consisting of: 

(i) affinity chromatography of insect gut extracts initially with benzamidine-sepharose 
15 to bind trypsins; 

(ii) further affinity chromatography of the unbound proteins using immobilized N. 
alata serine proteinase inhibitor CI to bind all NaPI inhibitable chymotypsins; and 

20 (iii) affinity chromatography of the eluate from (ii) with immobilized PotI and Potll or 
chymostatin to bind the remainder. The putative NaPI-insensitive chymotrypsins 
are then eluted with 8 M urea. 

The present invention extends to a genetic approach to down-regulating expression of an 
25 HpF5 or its homologs or variants. Such an approach uses nucleic acid molecules or 
molecules having a genetic component (e.g. RNAi) to induce pre- or post-transcriptional 
gene silencing. 

The terms "nucleic acids", "nucleotide" and "polynucleotide" include RNA, cDNA, 
30 genomic DNA, synthetic forms and mixed polymers, both sense and antisense strands, and 
may be chemically or biochemically modified or may contain non-natural or derivatized 
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nucleotide bases, as will be readily appreciated by those skilled in the art. Such 
modifications include, for example, labels, methylation, substitution of one or more of the 
naturally occurring nucleotides with an analog (such as the morpholine ring), 
internucleotide modifications such as uncharged linkages (e.g. methyl phosphonates, 

5 phosphotriesters, phosphoamidates, carbamates, etc.), charged linkages (e.g. 
phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g. polypeptides), 
intercalators (e.g. acridine, psoralen, etc.), chelators, alkylators and modified linkages (e.g. 
a-anomeric nucleic acids, etc.). Also included are synthetic molecules that mimic 
polynucleotides in their ability to bind to a designated sequence via hydrogen binding and 

10 other chemical interactions. Such molecules are known in the art and include, for example, 
those in which peptide linkages substitute for phosphate linkages in the backbone of the 
molecule. 

Antisense polynucleotide sequences, for example, are useful in silencing transcripts of 
15 HpF5, Furthermore, polynucleotide vectors containing all or a portion of HpF5 gene locus 
may be placed under the control of a promoter in either the sense or antisense orientation 
and introduced into a cell. Expression of such a sense or antisense construct within a cell 
interferes with target transcription and/or translation. Furthermore, co-suppression (i.e. 
using sense-suppresion) and mechanisms to induce RNAi or siRNA may also be 
20 employed. Alternatively, antisense or sense molecules may be directly administered. In 
this latter embodiment, the antisense or sense molecules may be formulated in a 
composition and then administered by any number of means to target cells. 

A variation on antisense and sense molecules involves the use of morpholinos, which are 
25 oligonucleotides composed of morpholine nucleotide derivatives and phosphorodiamidate 
linkages (for example, Summerton and Weller, Antisense and Nucleic Acid Drug 
Development 7: 187-195, 1997). Such compounds are injected into embryos and the effect 
of interference with mRNA is observed. 

30 In one embodiment, the present invention employs compounds such as oligonucleotides 
and similar species for use in modulating the function or effect of nucleic acid molecules 
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encoding HpChS, i.e. the oligonucleotides induce transcriptional or post-transcriptional 
gene silencing. This is accomplished by providing oligonucleotides which specifically 
hybridize with one or more nucleic acid molecules encoding the inhibitor. The 
oligonucleotides may be provided directly to a cell or generated within the cell. As used 

5 herein, the terms 'target nucleic acid" and "nucleic acid molecule encoding HpChS" have 
been used for convenience to encompass DNA encoding the inhibitor, RNA (including 
pre-mRNA and mRNA or portions thereof) transcribed from such DNA, and also cDNA 
derived from such RNA. The hybridization of a compound of the subject invention with its 
target nucleic acid is generally referred to as "antisense". Consequently, the preferred 

10 mechanism believed to be included in the practice of some preferred embodiments of the 
invention is referred to herein as "antisense inhibition." Such antisense inhibition is 
typically based upon hydrogen bonding-based hybridization of oligonucleotide strands or 
segments such that at least one strand or segment is cleaved, degraded, or otherwise 
rendered inoperable. In this regard, it is presently preferred to target specific nucleic acid 

1 5 molecules and their functions for such antisense inhibition. 

The functions of DNA to be interfered with can include replication and transcription. 
Replication and transcription, for example, can be from an endogenous cellular template, a 
vector, a plasmid construct or otherwise. The functions of RNA to be interfered with can 
20 include functions such as translocation of the RNA to a site of protein translation, 
translocation of the RNA to sites within the cell which are distant from the site of RNA 
synthesis, translation of protein from the RNA, splicing of the RNA to yield one or more 
RNA species, and catalytic activity or complex formation involving the RNA which may 
be engaged in or facilitated by the RNA. 

25 

In the context of this invention, "hybridization" means the pairing of complementary 
strands of oligomeric compounds. In the present invention, the preferred mechanism of 
pairing involves hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed 
Hoogsteen hydrogen bonding, between complementary nucleoside or nucleotide bases 
30 (nucleobases) of the strands of oligomeric compounds. For example, adenine and thymine 
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are complementary nucleobases which pair through the formation of hydrogen bonds. 
Hybridization can occur under varying circumstances. 

An antisense compound is specifically hybridizable when binding of the compound to the 
5 target nucleic acid interferes with the normal function of the target nucleic acid to cause a 
loss of activity, and there is a sufficient degree of complementarity to avoid non-specific 
binding of the antisense compound to non-target nucleic acid sequences under conditions 
in which specific binding is desired. 

10 "Complementary" as used herein, refers to the capacity for precise pairing between two 
nucleobases of an oligomeric compound. For example, if a nucleobase at a certain position 
of an oligonucleotide (an oligomeric compound), is capable of hydrogen bonding with a 
nucleobase at a certain position of a target nucleic acid, said target nucleic acid being a 
DNA, RNA, or oligonucleotide molecule, then the position of hydrogen bonding between 

15 the oligonucleotide and the target nucleic acid is considered to be a complementary 
position. The oligonucleotide and the further DNA, RNA, or oligonucleotide molecule are 
complementary to each other when a sufficient number of complementary positions in 
each molecule are occupied by nucleobases which can hydrogen bond with each other. 
Thus, "specifically hybridizable" and "complementary" are terms which are used to 

20 indicate a sufficient degree of precise pairing or complementarity over a sufficient number 
of nucleobases such that stable and specific binding occurs between the oligonucleotide 
and a target nucleic acid. 

According to the present invention, compounds include antisense oligomeric compounds, 
25 antisense oligonucleotides, ribozymes, external guide sequence (EGS) oligonucleotides, 
alternate splicers, primers, probes, and other oligomeric compounds which hybridize to at 
least a portion of the target nucleic acid. As such, these compounds may be introduced in 
the form of single-stranded, double-stranded, circular or hairpin oligomeric compounds 
and may contain structural elements such as internal or terminal bulges or loops. Once 
30 introduced to a system, the compounds of the invention may elicit the action of one or 
more enzymes or structural proteins to effect modification of the target nucleic acid. One 
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non-limiting example of such an enzyme is RNAse H, a cellular endonuclease which 
cleaves the RNA strand of an RNA:DNA duplex. It is known in the art that single-stranded 
antisense compounds which are "DNA-like" elicit RNAse H. Activation of RNase H, 
therefore, results in cleavage of the RNA target, thereby greatly enhancing the efficiency 
5 of oligonucleotide-mediated inhibition of gene expression. Similar roles have been 
postulated for other ribonucleases such as those in the RNase III and ribonuclease L family 
of enzymes. 

While the preferred form of antisense compound is a single-stranded antisense 
10 oligonucleotide, in many species the introduction of double-stranded structures, such as 
double-stranded RNA (dsRNA) molecules, has been shown to induce potent and specific 
antisense-mediated reduction of the function of a gene or its associated gene products. 

In the context of the subject invention, the term "oligomeric compound" refers to a 
15 polymer or oligomer comprising a plurality of monomeric units. In the context of this 
invention, the term "oligonucleotide" refers to an oligomer or polymer of ribonucleic acid 
(RNA) or deoxyribonucleic acid (DNA) or mimetics, chimeras, analogs and homologs 
thereof. This term includes oligonucleotides composed of naturally occurring nucleobases, 
sugars and covalent internucleoside (backbone) linkages as well as oligonucleotides having 
20 non-naturally occurring portions which function similarly. Such modified or substituted 
oligonucleotides are often preferred over native forms because of desirable properties such 
as, for example, enhanced cellular uptake, enhanced affinity for a target nucleic acid and 
increased stability in the presence of nucleases. 

25 While oligonucleotides are a preferred form of the compounds of this invention, the 
present invention comprehends other families of compounds as well, including but not 
limited to oligonucleotide analogs and mimetics such as those herein described. 

The open reading frame (ORF) or "coding region" which is known in the art to refer to the 
30 region between the translation initiation codon and the translation termination codon, is a 
region which may be effectively targeted. Within the context of the present invention, one 
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region is the intragenic region encompassing the translation initiation or termination codon 
of the open reading frame (ORF) of a gene. 

Other target regions include the 5' untranslated region (5'UTR), known in the art to refer 
5 to the portion of an mRNA in the 5' direction from the translation initiation codon, and 
thus including nucleotides between the 5' cap site and the translation initiation codon of an 
mRNA (or corresponding nucleotides on the gene), and the 3' untranslated region 
(3'UTR), known in the art to refer to the portion of an mRNA in the 3 5 direction from the 
translation termination codon, and thus including nucleotides between the translation 
10 termination codon and 3' end of an mRNA (or corresponding nucleotides on the gene). 
The 5' cap site of an mRNA comprises an N7-methylated guanosine residue joined to the 
5'-most residue of the mRNA via a 5'-5' triphosphate linkage. The 5' cap region of an 
mRNA is considered to include the 5* cap structure itself as well as the first 50 nucleotides 
adjacent to the cap site. It is also preferred to target the 5' cap region, 

15 

Although some eukaryotic mRNA transcripts are directly translated, many contain one or 
more regions, known as "introns", which are excised from a transcript before it is 
translated. The remaining (and, therefore, translated) regions are known as "exons" and are 
spliced together to form a continuous mRNA sequence. Targeting splice sites, i.e. intron- 

20 exon junctions or exon-intron junctions, may also be particularly useful in situations where 
aberrant splicing is implicated in disease, or where an overproduction of a particular splice 
product is implicated in disease. Aberrant fusion junctions due to rearrangements or 
deletions are also preferred target sites. mRNA transcripts produced via the process of 
splicing of two (or more) mRNAs from different gene sources are known as "fusion 

25 transcripts". It is also known that introns can be effectively targeted using antisense 
compounds targeted to, for example, DNA or pre-mRNA. 

As is known in the art, a nucleoside is a base-sugar combination. The base portion of the 
nucleoside is normally a heterocyclic base. The two most common classes of such 
30 heterocyclic bases are the purines and the pyrimidines. Nucleotides are nucleosides that 
further include a phosphate group covalently linked to the sugar portion of the nucleoside. 
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For those nucleosides that include a pentofuranosyl sugar, the phosphate group can be 
linked to either the 2\ V or 5' hydroxyl moiety of the sugar. In forming oligonucleotides, 
the phosphate groups covalently link adjacent nucleosides to one another to form a linear 
polymeric compound. In turn, the respective ends of this linear polymeric compound can 
5 be further joined to form a circular compound, however, linear compounds are generally 
preferred. In addition, linear compounds may have internal nucleobase complementarity 
and may, therefore, fold in a manner as to produce a fUlly or partially double-stranded 
compound. Within oligonucleotides, the phosphate groups are commonly referred to as 
forming the internucleoside backbone of the oligonucleotide. The normal linkage or 
10 backbone of RNA and DNA is a 3' to 5' phosphodiester linkage. 

For topical delivery of antisense compounds, these oligonucleotides may contain modified 
backbones or non-natural internucleoside linkages. As defined in this specification, 
oligonucleotides having modified backbones include those that retain a phosphorus atom in 
15 the backbone and those that do not have a phosphorus atom in the backbone. For the 
purposes of this specification, and as sometimes referenced in the art, modified 
oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can 
also be considered to be oligonucleosides. 

20 Preferred modified oligonucleotide backbones containing a phosphorus atom therein 
include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, 
phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates 
including 3'-alkylene phosphonates, 5-alkylene phosphonates and chiral phosphonates, 
phosphinates, phosphoramidates including 3 '-amino phosphoramidate and 

25 aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, 
thionoalkylphosphotriesters, selenophosphates and boranophosphates having normal 3 4 -5' 
linkages, 2 , -5' linked analogs of these, and those having inverted polarity wherein one or 
more internucleotide linkages is a 3' to 3', 5' to 5' or 2' to T linkage. Preferred 
oligonucleotides having inverted polarity comprise a single 3' to 3' linkage at the 3 5 -most 

30 internucleotide linkage i.e. a single inverted nucleoside residue which may be abasic (the 
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nucleobase is missing or has a hydroxyl group in place thereof). Various salts, mixed salts 
and free acid forms are also included. 

In an alternative embodiment, genetic constructs including DNA 'Vaccines" are used to 
5 generate antisense or sense molecules in plant cells. Furthermore, many of the preferred 
features described above are appropriate for sense nucleic acid molecules. 

A further aspect of the present invention relates to a method for control of insect 
populations, said method comprising administering to insects an effective amount of an 
10 agent for a time and under conditions sufficient to inhibit the expression of HpF5 or 
sufficient to inhibit the activity of HpChS, wherein said modulation results in reduction of 
the biological fitness of said insects. 

In one preferred embodiment of the present invention, the agent is one that can bind to the 
15 primary substrate binding pocket of HpChS, and not be interfered with by the arginine 
residue found at position 192. 

Reference to a "reduction of biological fitness" should be understood to be changes in the 
insect including, but not limited to, changes in body mass and/or viability. Preferably, 
20 these changes are understood as reductions in both body mass and/or viability. A reduction 
in biological fitness, therefore, includes a reduction in their growth and development. 

In yet another aspect the present invention provides a method for detecting an agent 
capable of modulating the function of HpChS or functional equivalent or derivative 
25 thereof, said method comprising administering to an insect containing said HpCh5 or 
functional equivalent or derivative thereof with a putative agent and detecting an altered 
activity phenotype associated with modulation of function of HpChS or its functional 
equivalent or derivative. 



30 Reference to "administration" of the modulator to HpChS refers to delivery of the 
modulating agent in any convenient means. In the agricultural setting this is likely to 
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include, but not be limited to:- 

(i) the delivery of the agent as the active ingredient in a spray or powder formulation; 
or 

5 

(ii) the production of the agent either as a protein or the product of a metabolic 
pathway in a plant. 

In a preferred embodiment of the invention, the administration of the agent is via the 
10 introduction of a nucleic acid encoding said agent into a plant for subsequent expression 
and production of the agent. Another preferred embodiment is the formulation of a spray or 
powder with said agent as the active ingredient. 

"Introduction" of the agent is to be understood to cover all means known to those in the art 
15 of making genetic changes in a plant. These include, but are not limited to plant 
transformation methods such as particle bombardment, Agrobacterium-mediated 
transformation, electroporation and viral transfection; plant-breeding techniques and 
mutagenesis of native plant genes. 

20 Accordingly, in the context of the present invention, nucleic acid molecules encoding an 
antisense or sense form of HpF5 or encoding an inhibitor of HpCh5 activity or HpF5 
expression is operably linked to a promoter, generally in a vector or other suitable medium 
for introduction to a plant genome. Alternatively, an existing PI may be cloned and 
modified to render it active against HpCh5. 

25 

The present invention further provides a genetically modified plant comprising cells which 
are capable of producing an antagonist of HpCh5 or HpF5 gene expression. In one 
particularly useful example, cotton or other crop plants are engineered to produce PotI or a 
combination of PotI and NaPI. Reference herein to "PotI" and "NaPI" includes reference 
30 to derivatives, variants and homologs including modifications to one or more domains in 
PotI or NaPI. 
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Reference herein to a "promoter" is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical genomic gene, including the TATA box 
which is required for accurate transcription initiation, with or without a CCAAT box 
5 sequence and additional regulatory elements (i.e. upstream activating sequences, enhancers 
and silencers) which alter gene expression in response to developmental and/or external 
stimuli, or in a tissue-specific manner. A promoter is usually, but not necessarily, 
positioned upstream or 5', or a structural gene region, the expression of which it regulates. 
Furthermore, the regulatory elements comprising a promoter are usually positioned within 
10 2 kb of the start site of transcription of the gene. 

In the present context, the term "promoter" is also used to describe a synthetic or fusion 
molecule, or derivative which confers, activates or enhances expression of a nucleic acid 
molecule in a cell. 

15 

Preferred promoters may contain additional copies of one or more specific regulatory 
elements, to further enhance expression of the sense molecule and/or to alter the spatial 
expression and/or temporal expression of said sense molecule. For example, regulatory 
elements which confer copper inducibility may be placed adjacent to a heterologous 
20 promoter sequence driving expression of a sense molecule, thereby conferring copper 
inducibility on the expression of said molecules. 

Placing a nucleic acid molecule under the regulatory control of a promoter sequence means 
positioning the said molecule such that expression is controlled by the promoter sequence. 

25 Promoters are generally positioned 5' (upstream) to the genes that they control. In the 
construction of heterologous promoter/structural gene combinations, it is generally 
preferred to position the promoter at a distance from the gene transcription start site that is 
approximately the same as the distance between that promoter and the gene it controls in 
its natural setting, i.e. the gene from which the promoter is derived. As is known in the art, 

30 some variation in this distance can be accommodated without loss of promoter function. 
Similarly, the preferred positioning of a regulatory sequence element with respect to a 
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heterologous gene to be placed under its control is defined by the positioning of the 
element in its natural setting, i.e. the genes from which it is derived. Again, as is known in 
the art, some variation in this distance can also occur. 

5 The promoter may regulate the expression of HpF5 or its variant or homolog 
constitutively, or differentially with respect to cell, the tissue or organ in which expression 
occurs or, with respect to the developmental stage at which expression occurs, or in 
response to external stimuli such as physiological stresses, or pathogens, or metal ions, 
amongst others. 

10 

Preferably, the promoter is capable of regulating expression of a nucleic acid molecule in a 
plant cell, tissue or organ, at least during the period of time over which the target gene is 
expressed therein and more preferably also immediately preceding the commencement of 
detectable expression of the HpFSgene in said cell, tissue or organ. 

15 

Accordingly, strong constitutive promoters are particularly useful for the purposes of the 
present invention or promoters which may be induced by virus infection or the 
commencement of HpF5 gene expression. 

20 Plant-operable promoters are particularly preferred for use in the construct of the present 
invention. Examples of suitable promoters include pCaMV 35S (Fang et al, Plant Cell 1: 
141-150, 1989), PGEL1 (Hajdukiewicz et al, Plant Mol Biol 25: 989-994, 1994), class 
in chitinase (Samac and Shah, Plant Cell 3: 1063-1072, 1991), pin2 (Keil et al, EMBO J. 
8: 1323-1330, 1989), PEP carboxylase (Pathirana et al, Plant J. 12: 293-304, 1997; MAP 

25 kinase (Schoenbeck et al, Molec. Plant-Microbe Interact, 1999), MSV (Legavre et al, In: 
Vth International Congress of Plant Molecular Biology, Singapore, 1997), pltp (Hsu et al, 
Plant Set 143: 63-70, 1999), pmpi (Cordero et al, In: General Meeting of the International 
Program on Rice Biotechnology of the Rockefeller Foundation, Malacca, Malaysia, 1997) 
or glutamin synthase (Pujade-Renaud et al, Plant Physiol Biochem. 35: 85-93, 1997). 

30 
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In the present context, the terms "in operable connection with" or "operably under the 
control" or similar shall be taken to indicate that expression of the nucleic acid molecule is 
under the control of the promoter sequence with which it is spatially connected; in a cell, 
tissue, organ or whole plant. 

5 

The construct preferably contains additional regulatory elements for efficient transcription, 
for example, a transcription termination sequence. 

The term 'terminator" refers to a DNA sequence at the end of a transcriptional unit which 
10 signals termination of transcription. Terminators are 3 '-non-translated DNA sequences 
generally containing a polyadenylation signal, which facilitates the addition of 
polyadenylate sequences to the 3 '-end of a primary transcript. Terminators active in plant 
cells are known and described in the literature. They may be isolated from bacteria, fungi, 
viruses, animals and/or plants or synthesized de novo. 

15 

As with promoter sequences, the terminator may be any terminator sequence which is 
operable in the cells, tissues or organs in which it is intended to be used. 

Examples of terminators particularly suitable for use in the synthetic genes of the present 
20 invention include the SV40 polyadenylation signal, the HSV TK polyadenylation signal, 
the CYC1 terminator, ADH terminator, SPA terminator, nopaline synthase (NOS) gene 
terminator of Agrobacterium tumefaciens, the terminator of the cauliflower mosaic virus 
(CaMV) 35S gene, the zein gene terminator from Zea mays, the Rubisco small subunit 
gene (SSU) gene terminator sequences, subclover stunt virus (SCSV) gene sequence 
25 terminators, any Mo-independent E. coli terminator, or the lacZ alpha terminator, amongst 
others. 

In a particularly preferred embodiment, the terminator is the SV40 polyadenylation signal 
or the HSV TK polyadenylation signal which are operable in animal cells, tissues and 
30 organs, octopine synthase (OCS) or nopaline synthase (NOS) terminator active in plant 
cells, tissue or organs, or the lacZ alpha terminator which is active in prokaryotic cells. 
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Those skilled in the art will be aware of additional terminator sequences which may be 
suitable for use in performing the invention. Such sequences may readily be used without 
any undue experimentation. 

5 

Means for introducing (i.e. transfecting or transforming) cells with the constructs are well- 
known to those skilled in the art. 

The constructs described supra are capable of being modified further, for example, by the 
10 inclusion of marker nucleotide sequences encoding a detectable marker enzyme or a 
functional analogue or derivative thereof, to facilitate detection of the synthetic gene in a 
cell, tissue or organ in which it is expressed. According to this embodiment, the marker 
nucleotide sequences will be present in a translatable format and be expressed. 

15 Those skilled in the art will be aware of how to produce the constructs described herein 
and of the requirements for obtaining the expression thereof, when so desired, in a specific 
cell or cell-type under the conditions desired. In particular, it will be known to those skilled 
in the art that the genetic manipulations required to perform the present invention may 
require the propagation of a genetic construct described herein or a derivative thereof in a 

20 prokaryotic cell such as an E. coli cell or a plant cell or an animal cell. 

The constructs of the present invention may be introduced to a suitable cell, tissue or organ 
without modification as linear DNA, optionally contained within a suitable carrier, such as 
a cell, virus particle or liposome, amongst others. To produce a genetic construct, a nucleic 
25 acid (e.g. HpF5) is inserted into a suitable vector or episome molecule, such as a 
bacteriophage vector, viral vector or a plasmid, cosmid or artificial chromosome vector 
which is capable of being maintained and/or replicated and/or expressed in the host cell, 
tissue or organ into which it is subsequently introduced. 
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Accordingly, a further aspect of the invention provides a genetic construct which at least 
comprises a genetic element as herein described and one or more origins of replication 
and/or selectable marker gene sequences. 

5 Usually, an origin of replication or a selectable marker gene suitable for use in bacteria is 
physically-separated from those genetic sequences contained in the genetic construct 
which are intended to be expressed or transferred to a plant cell, or integrated into the 
genome of a plant cell. 

10 As used herein, the term "selectable marker gene" includes any gene which confers a 
phenotype on a cell on which it is expressed to facilitate the identification and/or selection 
of cells which are transfected or transformed with a genetic construct of the invention or a 
derivative thereof. 

15 Suitable selectable marker genes contemplated herein include the ampicillin-resistance 
gene (Amp 1 ), tetracycline-resistance gene (Tc 1 ), bacterial kanamycin-resistance gene 
(Kan 1 ), the zeocin resistance gene (Zeocin is a drug of the bleomycin family which is trade 
mark of InVitrogen Corporation), the AURI-C gene which confers resistance to the 
antibiotic aureobasidin A, phosphinothricin-resistance gene, neomycin phosphotransferase 

20 gen (nptH), hygromycin-resistance gene, (3-glucuronidase (GUS) gene, chloramphenicol 
acetyltransferase (CAT) gene, green fluorescent protein-encoding gene or the luciferase 
gene, amongst others. 

Preferably, the selectable marker gene is the nptll gene or Kan r gene or green fluorescent 
25 protein (GFP)-encoding gene. 

Those skilled in the art will be aware of other selectable marker genes useful in the 
performance of the present invention and the subject invention is not limited by the nature 
of the selectable marker gene. 
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The present invention extends to all genetic constructs essentially as described herein, 
which include further genetic sequences intended for the maintenance and/or replication of 
said genetic construct in prokaryotes or eukaryotes and/or the integration of said genetic 
construct or a part thereof into the genome of a eukaryotic cell or organism. 

5 

Standard methods described supra may be used to introduce the constructs into the cell, 
tissue or organ, for example, liposome-mediated transfection or transformation, 
transformation of cells with attenuated virus particles or bacterial cells, cell mating, 
transformation or transfection procedures known to those skilled in the art. 

10 

Additional means for introducing recombinant DNA into plant tissue or cells include, but 
are not limited to, transformation using CaCb and variations thereof, direct DNA uptake 
into protoplasts, PEG-mediated uptake to protoplasts, microparticle bombardment, 
electroporation, microinjection of DNA, microparticle bombardment of tissue explant or 
15 cells, vacuum-infiltration of tissue with nucleic acid, or in the case of plants, T-DNA- 
mediated transfer from Agrobacterium to the plant tissue. 

For microparticle bombardment of cells, a microparticle is propelled into a cell to produce 
a transformed cell. Any suitable ballistic cell transformation methodology and apparatus 
20 can be used in performing the present invention. Exemplary apparatus and procedures are 
disclosed by Stomp et al (U.S. Patent No, 5,122,466) and Sanford and Wolf (U.S. Patent 
No. 4,945,050). When using ballistic transformation procedures, the genetic construct may 
incorporate a plasmid capable of replicating in the cell to be transformed. 

25 Examples of microparticles suitable for use in such systems include 1 to 5 jim gold 
spheres. The DNA construct may be deposited on the microparticle by any suitable 
technique, such as by precipitation. 

In a further embodiment of the present invention, the genetic constructs described herein 
30 are adapted for integration into the genome of a cell in which it is expressed. Those skilled 
in the art will be aware that, in order to achieve integration of a genetic sequence or genetic 
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construct into the genome of a host cell, certain additional genetic sequences may be 
required. In the case of plants, left and right border sequences from the T-DNA of the 
Agrobacterium tumefaciens Ti plasmid will generally be required. 

5 The present invention further extends to an isolated cell, tissue or organ comprising the 
constructs or parts thereof. The present invention extends further to regenerated tissues, 
organs and whole organisms derived from said cells, tissues and organs and to propagules 
and progeny thereof as well as seeds and other reproductive material. 

10 For example, plants may be regenerated from transformed plant cells or tissues or organs 
on hormone-containing media and the regenerated plants may take a variety of forms, such 
as chimeras of transformed cells and non-transformed cells; clonal transformants (e.g. all 
cells transformed to contain the expression cassette); grafts of transformed and 
untransformed tissue (e.g. a transformed root stock grafted to an untransformed scion in 

15 citrus species). Transformed plants may be propagated by a variety of means, such as by 
clonal propagation or classical breeding techniques. For example, first generation (or Tl) 
transformed plants may be selfed to give homozygous second generation (or T2) 
transformed plants, and the T2 plants further propagated through classical breeding 
techniques. 

20 

Plants contemplated herein include cotton, sweet corn, tomato, tobacco, piniento, potato, 
sunflower, citrus, plums, sorghum, leeks, soybean, alfalfa, beans, pidgeon peas, chick peas, 
artichokes, curcurbits, lettuce, Dianthus (an ornamental plant), geraniums, cape 
gooseberry, maize, flax and linseed, lupins, broad beans, garden peas, peanuts, canola, 
25 snapdragons, cherry, sunflower, pot marigolds, Helichtysum (an ornamental plant), wheat, 
barley, oats, triticale, carrots, onions, orchids, roses and petunias 

Another aspect of the present invention relates to the insensitive chymotrypsin as a 
selectable marker for the transformation of insects. At present, methods for the germ-line 
30 transformation of insects, involves injection of insect embryos with a genetic construct 
comprising a transposable element, the gene of interest and a selectable marker. Somatic 
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transformation of insects can also be achieved using viral vectors that include the gene of 
interest and a selectable marker (Peloquin et aL, J. Cot. ScL 5: 1 14-120, 2001). Presently, 
most insect transformation is done using white-eye mutants of the insect, to allow the 
detection of the commonly used selectable markers. Typically, selectable markers are 
5 genes that complement the white-eye mutation or Enhanced Green Fluorescent Protein 
(EGFP). In Drosophila, the white eye mutation is caused by mutant alleles such as the 
TB un allele. These individuals can be returned to normal (red) eye pigmentation via the 
introduction of an allele conferring normal eye pigmentation such as white (Lidholm et aL, 
Genetics 134: 859-868, 1993) or minwhite (Lozovskaya et aL, Genetics 142: 173-177, 
10 1996). The reversion of tu 1118 mutants to normal eye pigmentation acts as the marker for 
introduction of the vector. In a similar way, EGFP has been used to indicate the presence 
of a vector. Again, insects with non-pigmented eyes are used, and EGFP expression is 
detected in the eyes of these insects (Hediger et aL, Insect MoL Biol. 10: 113-1 19, 2001). 

15 The selectable markers commonly used in the art require dissection of the insects to 
examine the eyes for either pigmentation or EGFP fluorescence, which is time consuming 
and requires destructive sampling of the insects. The present invention provides a means 
for the selection of transformed individuals without the need for insect dissection or 
inspection of individual insects. This would allow the recovery of live transformants and 

20 provides a non-laborious means of screening large numbers of putative transformants at 
one time. In addition, the present invention provides a means for the selection of 
transformants that does not rely on the availability of white-eye mutants. 

The present invention contemplates the use of HpF5 or a derivative, homolog or analog 
25 thereof encoding a NaPI-insensitive chymotrypsin, as a selectable marker in an insect 
transformation vector. This vector, comprising HpF5, would have utility for the selection 
of transformants for any insect that is susceptible to the CI serine proteinase inhibitor of N. 
alata. The contemplated insects may be naturally resistant to CI, or may be NaPI- 
susceptible mutants or genetically modified NaPI susceptible strains of naturally NaPI- 
30 resistant insects. In a particularly preferred embodiment of the invention the insect host of 
the said vector would be Lepidopteran. 
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HpF5 or a derivative, homolog or analog thereof encoding an NaPI-insensitive 
chymotrypsin could be incorporated into any insect transformation vector using common 
molecular biology techniques known to those in the art. Upon transformation the insect 
5 would transcribe HpF5 and produce HpChS, the NaPI-insensitive chymotrypsin. 
Transformed individuals could then be selected by incorporation of the CI proteinase 
inhibitor of N. alata into the diet of the insects. In this case individuals that did not carry 
the insensitive chymotrypsin encoded by HpF5 in the vector would die, and those that did 
carry the vector encoding HpF5 would be insensitive to CI . 

10 

Accordingly the present invention provides insect transformation vectors including 
baculovirus vectors comprising HpF5 or a derivative, homolog or analog thereof, as a 
selectable marker. The vector may be used for any purpose in the insect. Non-limiting 
examples include: gene cloning, gene expression and gene knockouts. Specific examples 

15 of insect transformation vectors into which HpF5 could be incorporated as a selectable 
marker include, but are not limited to: those that utilize the piggyBac mobilizable element 
(Hediger et al, 2001, supra); P-element based vectors (Cripps et al, J. Cell Biol 126: 
689-699, 1994); hobo element based vectors (Lozovskaya et al, 1996, supra); mariner 
element based vectors (Lidholm et al, 1993, supra); and viral vectors such as pTE/3'2J 

20 (Peloquin et al, 200 1 , supra). 

The present invention is further described by the following non-limiting Examples. 
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EXAMPLE 1 

Effect of ingestion of the K alata proteinase inhibitors on 
growth and development of Helicoverpa species 

5 The effect of the N. alata Pis is examined on the digestive enzymes and the growth and 
development of Helicoverpa punctigera and Helicoverpa armigera larvae (Heath et aU J- 
Insect Physiol 43: 833-842, 1997). The Pis suppressed total gut protease activity by 73% 
in an in vitro assay using 14 C-casein as substrate. When incorporated into an artificial diet 
the Pis retarded the growth and development of both H punctigera (Heath et a/., 1997, 

10 supra) (Figure 2 A and 2B) and H. armigera. Similar results are obtained when larvae from 
both species were fed on transgenic tobacco (N. tabacum) expressing the N. alata Pis at 
levels of 0.2% -0.5% soluble protein (Heath et al, 1997, supra). 

Ingestion of NaPIs (CI, C2 and T1-T4) changes the relative activity of the trypsins and 
15 chymotrypsins in the gut and faeces of H punctigera larvae. Trypsin activity in the gut and 
the faeces is substantially lowered or abolished after exposure to NaPIs, whereas 
chymotrypsin activity is often unaffected or enhanced (Figures 2C and 2D). The lack of 
trypsin activity in the faeces of the NaPI fed larvae was not due to decreased production of 
trypsin. Indeed protein blots were used to demonstrate that these insects had overproduced 
20 trypsin in response to the Pis, but it had been totally inactivated (Figure 2E). 

Interestingly, the chymotrypsin produced after ingestion of NaPIs was not inhibitable by 
the NaPIs, whereas some of the chymotrypsin produced by control insects (not exposed to 
NaPIs) was inhibitable (Figure 2F). In control insects the degree of inhibition varied 

25 between individuals, and ranged from about 75% to no inhibition at all This suggested that 
Helicoverpa larvae produce different classes of chymotrypsins; some that are inhibitable 
by the NaPIs (Pi-sensitive) and some that are not inhibitable (Pi-insensitive). Thus, it 
appears ingestion of NaPIs inhibits the trypsin activity in the gut of Helicoverpa larvae, but 
larvae may not be severely impacted if they produce NaPI-insensitive chymotrypsins. In a 

30 subsequent experiment, the gut contents were removed from H punctigera larvae that had 
been raised on a haricot bean diet without added proteinase inhibitors. The gut 
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chymotrypsin activity was inhibited about 80% by the NaPI inhibitors (CI, C2, T1-T4) and 
70% by CI alone (Figure 3), The Bowman Birk and trypsin inhibitors from Soybean and 
the lima bean trypsin inhibitor also failed to abolish all the chymotrysin activity, whereas 
activity was totally inhibited by the potato PotI inhibitor and chymostatin. Using this 
5 information a series of affinity columns were prepared for identification and purification of 
the NaPI-insensitive and -sensitive chymotrypsins from the gut of H. punctigera larvae. 

Bioassays with Helicoverpa larvae on artificial diets 
Haricot bean diet 

10 Helicoverpa punctigera larvae were raised on artificial diets based on Haricot beans 
(Teakle et al. Journal of Invertebrate Pathology 46: 166-173, 1985). One litre of diet was 
composed of 58.5 g Haricot beans, 14 g agar, 700 ml water, 35 g Tortula yeast, 50 g 
wheatgerm, 3,5 g ascorbic acid, 1.1 g sorbic acid, 2.2 g p-hydroxybenzoic acid methyl 
ester, 0.2 g ampicillin, 0.2 g streptomycin, 16 mg prochloraz. The beans were soaked 

1 5 overnight in water, drained and homogenized to a fine paste. Wheatgerm, yeast and 300 ml 
of water were added. The agar was dissolved in 400 mL of boiling water and added to the 
mixture. The mixture was cooled to 50°C before the addition of the remaining ingredients. 
The blended diet was poured into trays and after setting was used immediately or stored at 
-20°C for no longer than two weeks. The test diet was supplemented with the NaPI (0.26% 

20 (w/v)) and the control diet had an equivalent amount of casein. Twenty newly emerged 
neonates were added to each diet and mortality was recorded every two days. Weight gain 
was recorded at the sixth day and then every second day thereafter. The larvae were reared 
in 1.5 ml eppendorf microfuge tubes (one larva/tube) until day eight when they were 
transferred to individual plastic containers with lids (SOLO [trademark] plastic portion 

25 cups, 28 mL). Larvae were fed small amounts of diet (40 mg) initially that was replaced as 
required to provide a continuous supply. The larvae were kept in a temperature controlled 
roomat25±l°C, 16:8 (L:D). 

Cotton leaf diet 

30 Cotton leaf artificial diet was prepared from fresh young leaves from cotton plants (cultivar 
Coker 315) which were grown in an insect-free and insecticide-free temperature controlled 
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cabinet at 26°C (±2°C) with a light regime of 16:8 (L:D). Following picking, the leaves 
were immediately frozen in liquid nitrogen and freeze dried. After drying, the leaves were 
ground to a fine powder in a mortar and pestle. The cotton leaf artificial diet was prepared 
in the same manner as haricot bean artificial diet using a recipe modified from potato leaf 
5 artificial diet (Gatehouse et al, J. Insect Physiol. 45 (<5), 545-558, 1999). One hundred 
grams of cotton leaf artificial diet contained 3 g of cotton leaf powder, 0.08 mL linseed oil, 
2 g yeast, 0.016 mL wheatgerm oil, 2.4 g wheat germ, 0.028 g ampicillin, 3.2 g ascorbic 
acid, 0.028 g streptomycin, 0.08 g sorbic acid, 3.2 g agar, 0.16 g paraben (mould inhibitor) 
plus NaPI or casein to required % ((w/v)). 

10 

Preparation of individual gut and frass extracts 

All work was performed at 4°C. Each gut was dissected from the larva and placed in a 1.5 
mL microfuge tube containing 500 \\L of ice-cold 10 mM Tris-HCl, pH 8. Gut and 
contents were homogenized using a micropestle (Eppendorf). Insoluble material was 
15 removed by centrifugation at 13,000 g for 4 min and the supernatant was stored at -80°C. 
Frass extracts were prepared in the same manner using 200 mg frass/mL buffer. Total 
protein concentration was determined using the Bradford method (Bradford, Anal Biochem 
72: 248-254, 1976) with reagents from Bio-Rad and BSA as a standard. 

20 Trypsin and chymotrypsin activity 

Gut proteinase activity was determined at pH 10 in 50 mM 3-(cyclohexylamino)-l- 
propanesulfonic acid (CAPS buffer) using the chromogenic substrate N-benzoyl-DL- 
arginine-/?-nitroanilide (BApNA) for trypsin and N-succinyl-L-alanine-alanine-proline- 
phenylalanine-p-nitroanilide (SA 2 PFpNA) for chymotrypsin activity. Substrates were 

25 freshly prepared as 1 mM solutions in 10% ((w/v)) N,N-dimethylformamide (DMF), 50 
mM CAPS buffer, pH 10. The assays were performed in duplicate or triplicate following 
the method of Heath et al, EurJ.Biochem.230 (I): 250-257. Blanks, without enzyme, were 
used to account for any spontaneous breakdown of substrates. The release of /?-nitroanilide 
was recorded .at 405 nm after 30 min at 30°C on a SpectraMax 250 microtitre plate reader 

30 (Molecular Devices). 
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Inhibition of trypsin and chymotrypsin activity by NaPI 

Trypsin and chymotrypsin inhibition assays were conducted using the standard trypsin and 
chymotrypsin assays described above except samples were pre-incubated with Tl or CI 
inhibitor (80nM) for 30 min at 30°C prior to the addition of substrate to initiate the 
5 reaction. NaPI monomers Tl and CI were HPLC purified as described by Heath et al, 
1999, supra. 

Chymotrypsin assays in the presence and absence of proteinase inhibitors 
Preparation of gut extracts 

10 Fourth-instar larvae were killed using ethyl-acetate prior the removal of individual gut 
which were then homogenized in a mortar and pestle in an equal amount [(w/v)] of 50 mM 
Tris-HCl, pH 8.0 containing 100 |jM benzamidine. Insoluble material was removed by 
centrifugation at 20,000 rpm for 15 min at 4°C and total protein concentration determined 
using the Bradford method (Bradford, Anal Biochem 72: 248-254, 1976) with reagents 

1 5 from BioRad and BS A as a standard. 

Assays 

Unfractionated gut extract containing approximately 1 jug of buffer soluble protein was 
added to 10 jiL CAPS buffer (0.5 M, 3-[cyclohexylamino]-l-propane-sulfonic acid, pH 10) 

20 and made to a final volume of 100 [iL in individual wells of a 96 well microtiter plate. 
Proteinase inhibitors were added over a range of concentrations and incubated for 15 min 
at 25°C. Bovine chymotrypsin (100 ng) was used as a positive control, both for activity 
and inhibition. The chromogenic artificial substrate SAAPFpNA (N-Succinyl-L-alanyl-L- 
alanyl-L-prolyl-L-phenylalanine 4-nitroanilide) was then added to a final concentration of 

25 1 mM and hydrolysis of the substrate was measured at 405 nm using the SpectraMax 250 
microtiter plate reader (Molecular Devices). 

Proteinase inhibitors 

Lima bean trypsin inhibitor (LBTI), soybean trypsin inhibitor (SBTI), soybean Bowman- 
30 Birk inhibitor (SBBI) and chymostatin were all purchased from Sigma-Aldrich Pty.Ltd. 
Potato Inhibitor I was purchased from Calbiochem-Novabiochem or purified from potato 
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tubers (Example 6). A erode mixture of Potato inhibitor I and II was also obtained from 
potatoes, N. alata proteinase inhibitors (NaPIs) were purified as described by Atkinson et 
al.y 1993, supra and Heath et al, 1995, supra. The chymotrypsin inhibitor CI was purified 
from bacterial expression cultures. The purity of the inhibitors was assessed by SDS- 
5 PAGE and silver staining. 

EXAMPLE 2 

Isolation of N API sensitive and insensitive chymotrypsins from H. punctigera gut 

10 Most published work on Helicoverpa chymotrypsins has focused on cDNA clones or the 
measurement of enzyme activity in unfractionated gut extracts. There are few reports on 
purification of chymotrypsins from Helicoverpa or other lepidopteran species. Johnston 
and coworkers (1995, supra) described partial purification of chymotrypsins from 
Karmigera that employed ion exchange techniques. Peterson and coworkers (Insect t 

15 Biochem. Mol Biol 25: 165-11 A, 1995) purified a chymotrysin from the midgut of the 
lepidopteran Manduca sexta by affinity chromatography on tryptophan methyl ester and 
Valiatis et al, Insect Biochemistry and Molecular Biology 29; 405-415, 1999 used 
immobilized potato proteinase inhibitor I (PotI) to isolate a chymotrypsin from the 
Western Spruce budworm. No one has described a procedure that separates individual 

20 chymotrypsin isozymes from one another and there is no description of the isolation of two 
chymotrypsins from a single species where one isozyme is inhibitable by a certain 
proteinase inhibitor while another is not inhibitable. 

Preparation of pure enzymes and N-terminal sequencing 

25 The midgut was dissected from 80 fourth instar larvae and buffer soluble extracts were 
prepared. The gut extract was depleted of trypsins by repeated passage through a 
benzamidine-Sepharose affinity column. The unbound protein was collected and applied to 
an affinity column composed of the immobilized chymotrypsin inhibitor CI (Figure 1) that 
had been produced using a bacterial expression system. This column was expected to 

30 specifically bind NaPI-sensitive chymotrysins. Proteins that did not bind to this column 
were applied to a third affinity column composed of either immobilized Potato Type I 
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(Potl) and Type II inhibitors (Potll) or chymostatin. This column was designed to capture 
the chymotrypsins that did not bind to the CI column, that is, the NaPI-insensitive 
chymotrypsins. Proteins that bound to the affinity columns were eluted with 8 M urea and 
were subjected to electrophoresis through an SDS-polyacrylamide gel before transfer to a 
5 PVDF membrane for N-terminal sequencing (Figures 4 and 5). 

About 30 amino acids of N-terminal sequence were obtained that confirmed that the 
proteins were indeed, chymotrypsins, and that the sensitive and insensitive chymotrypsins 
were products* of different genes (Figures 4 and 5). 

10 

Preparation of NaPI-insensitive chymotrypsin depleted of trypsins and NaPI sensitive 
chymotrypsins for biochemical analysis 

The midgut was dissected from 100 fourth instar larvae and buffer soluble extracts were 
prepared. The gut extract was depleted of NaPI-sensitive proteases by passage through an 

1 5 affinity column composed of immobilized NaPI protein (C 1 , C2, Tl -T4). All trypsins and 
NaPI-sensitive chymotrypsins bound to the column and the NaPI-insensitive chymotrypsin 
was unbound. This preparation of unbound material was used to study the pH optimum, 
substrate preference and effect of a range of proteinase inhibitors on the activity of the 
NaPI-insensitive proteinase. The effect of pH on activity of the insensitive chymotrypsins 

20 is illustrated in Figure 6. The enzyme is inactive below pH6 and is most active at pH10-12 
consistent with its role in the alkaline midgut of larvae. The best substrate for enzyme 
assays was determined using seven different commercial substrates. The best substrate was 
N-succinyl-Ala-Ala-Pro-Phe-p-nitroanilide (sAAPF-pNA) followed by N-succinyl-Ala- 
Ala-Pro-Leu-p-nitroanilide (sAAPL-pNA) and N-methoxysuccinyl-Ala-Ala-Pro-Met-p- 

25 nitroanilide (mAAPM-pNA). N-succinyl-Ala-Ala-Ala-p-nitroanilide (sAAA-pNA), 
benzoyl-tyr- p-NA, Ac-Pro-Leu-Ser-p-NA and Ac-Asn-Gly-Ile-Pro-p-NA were not 
substrates. 

Several proteinase inhibitors were tested for their ability to inhibit the activity of the 
30 insensitive chymotrypsins. The buffer used for all assays was 500 mM CAPS, 75 mM 
NaCl, 2.5 mM MgCk at pH 10. The inhibitors were preincubated with 100 ng of bovine 
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chymotrypsin or the amount of the gut chymotrypsin required to produce the same 
absorbance as 100 ng of bovine chymotrypsin (note that the Helicoverpa enzyme has been 
depleted of trypsins and sensitive chymotrypsins, but still contains other gut proteins) for 
30 min at 30°C before the addition of substrate. The incubation was continued for a further 
5 30 min at room temperature before absorbance was measured at 405 nm. PotI was the best 
of the proteinaceous inhibitors and NaPI did not inhibit (Table 3). 

Experimental protocols 

1 0 Benzamidine Column 

Benzamidine-agarose (1 mL; was purchased from ICN Biomedicals) and contained 35 
umoles benzamidine per ml of gel. 

Production of the Cl~affinity column 

15 DNA encoding the CI domain of NaPI (Figure 1, Atkinson et al, 1993, supra) was 
amplified from the pNa-PI-2-cDNA (Atkinson et al, 1993, supra) using oligonucleotide 
primers that incorporated BamUl (5' GACCAGCCGGATCCGATCGGATAT 
GCACCAAC) [SEQ ID NO:7] and HindOl (3' GGAGCCAAGCCAAGCTTTGAACGCG 
GGCAAACTC) [SEQ ID NO:8] sites for cloning into a pQE expression vector (Qiagen). 

20 The PCR product was sub-cloned into the pCR (registered trademark) 2.1-TOPO vector 
(Invitrogen) then excised with BamUl and HindlU and ligated into the pQE-30 vector. The 
expression vector incorporated a hexahistidine tag at the N-terminus of the expressed 
protein for metal affinity purification. The Cl/pQE-30 construct was transformed into the 
chemically competent E. coli strain Ml 5 (Qiagen) prepared according to the method of 

25 Inoue et al t Gene 96: 23-28, 1990. Bacterial expression cultures were grown and induced 
according to the procedures outlined in the QiaExpress manual (Qiagen). Expression of the 
6H.C1 recombinant protein was achieved by induction with 1 mM of IPTG. Samples (1 
ml) were removed from the culture at hourly intervals, collected by centrifugation and 
resuspended in IX SDS loading buffer (100 \xL) for SDS-PAGE analysis. 

30 

Recombinant CI inhibitor was purified under denaturing conditions using Talon Metal 
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Affinity Resin (2 mL) (BD Biosciences Clontech) according to the manufacturer's 
protocol CI was eluted from the affinity matrix and examined using SDS-PAGE to 
confirm purity. Subsequent rounds of expression were conducted under the same 
conditions and CI from a total of 2L of culture was purified using the Talon resin. After 
5 washing to remove unbound proteins the CI inhibitor remained bound to the Talon resin 
which was then used as the affinity column to purify chymotrypsins from gut preparations. 

Potato Inhibitor I and potato Inhibitor II affinity column 

10 Cyanogen bromide-activated Sepharose 4B (1 g) was swollen and washed according to the 
manufacturer's protocol. A mixture of potato I and II Inhibitors (10 mg) was dissolved in 
coupling buffer [0.1 M NaHC0 3 , 0.5 M NaCl, pH 8.3], combined with the gel suspension 
and incubated over night at 4°C in an end-over-end mixer. The gel was rinsed several 
times in blocking buffer [0.1 M Tris-HCl, pH 8.0] to remove excess ligand. Following the 

15 washes the conjugated Sepharose was mixed with fresh blocking buffer and incubated 
overnight at 4°C. The gel slurry was transferred to a column (Amersham Biosciences) and 
washed alternately (x3) with five column volumes of coupling buffer then five column 
volumes of rinse buffer [0.1 M NaOAc, 0.5 M NaCl, pH 4.0]. Finally the matrix was 
extensively washed with coupling buffer and stored in 20% (v/v) ethanol at 4°C. 

20 

Chymostatin affinity column 

Chymostatin (Sigma-Aldrich) was immobilized on EAH Sepharose-4B (1.5 ml; Sigma- 
Aldrich) according to the manufacturer's instructions. Chymostatin (10 mg) was dissolved 
in 500 \xL of glacial acetic acid then 1 mL of distilled water was added and the pH adjusted 
25 to 4.5 with dilute NaOH. The gel suspension and chymostatin solution were combined 
before EDC [N-ethyl-N-(3-dimethylaminopropyl) carbodiimide] was slowly added to a 
final concentration of 0.1 M. The gel suspension was mixed overnight at 4°C before the gel 
was washed with Milli-Q filtered water to remove excess ligand and unreacted 
carbodiimide and the gel matrix was stored at 4°C in 20% [(v/v)] ethanol. 
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Purification of NaPI- sensitive and insensitive chymotrypsins 

Eighty gut from fifth instar larvae that had been stored at -80°C were homogenized using a 
mortar and pestle in 10 ml of gut extraction buffer (20 mM CAPS pH 10, 350 mM NaCl). 
The gut extract was centrifuged at 15,000 g for 15 min at 4°C and the supernatant was 
5 filtered through a syringe filter (0.45 ^M; Millipore) then briefly stored on ice before 
application to the benzamidine affinity column. 

The filtered gut extract was passed through the benzamidine column five times to remove 
the trypsins. The unbound fraction (10 mL) was passed through the CI affinity column 
10 (x3) before the column was washed with 20mL of extraction buffer and bound proteins 
were eluted with 8 M urea, pH 8.0 (5 mL). Proteins that did not bind to the CI column 
were applied to the Potato Inhibitor I and II affinity column prior to washing with 10 
column volumes of buffer [20 mM CAPS pH 10, 0.5 M NaCl] and elution of bound 
proteins with 8 M urea, pH 8.0 (5 mL). 

15 

Analysis of affinity purified gut proteins: SDS-PAGE 

Samples of gut proteins (5-10 jag) were concentrated using TCA precipitation and 
resuspended in 0.2 M NaOH (10 ^L). SDS-PAGE sample buffer (50 mM Tris-HCl, pH 
6.8, 2% (w/v) SDS, 10% (v/v) glycerol, 5% (v/v) p-mercaptoethanol, 0.1% (w/v)] 

20 bromophenol blue) was added and samples were heated to 100°C for 5min prior to 
separation on 12.5% (w/v) reducing polyacrylamide gels using the MiniProtean II 
Electrophoresis apparatus (Bio-Rad) at 200 Volts. Broad range or peptide molecular size 
markers (Bio-Rad) were used to estimate relative molecular masses. Following 
electrophoresis the proteins were visualized by staining with Coomassie Brilliant Blue R- 

25 250 (0.1% (w/v) in 40% (v/v) methanol, 10% (v/v) acetic acid) for 60 min followed by 
destaining in 40% (v/v) methanol, 10% (v/v) acetic acid or were transferred to either 
nitrocellulose (0.22 \M pore size; Micron Separations Inc.) or Sequi-Blot Polyvinylidene 
Fluoride membrane (PVDF; Bio-Rad). 
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Immunoblotting and N-terminal sequencing 

After electrophoresis gels were equilibrated in transfer buffer (192 mM glycine, 48 mM 
Tris-base, 20% (v/v) methanol) for 10 min prior to the transfer of proteins to a 
nitrocellulose membrane (0.22 uM pore size; Micron Separations Inc.) using the Mini 

5 Trans-Blot apparatus (BioRad) at 100 V for 60 min. Membranes were briefly washed in 
TBS (20 mM Tris-HCl, 150 mM NaCl, pH 7.5) then stained with amido black (1:50 
dilution of 0.1% (w/v) amido black, 40% (v/v) methanol, 10% (v/v) acetic acid) to confirm 
transfer of the proteins and to visualize the molecular size markers. Blots were then 
blocked by incubation with 3% (w/v) skim milk powder (Dutchjug) in TBST [0.1% (w/v) 

10 Tween-20 in TBS] for 1 h at RT, followed by a 1 h incubation with the cc-chymotrypsin 
antibody (HpCH2B; 1:5000 dilution in 3% (w/v) skim milk powder in TBST). The 
nitrocellulose blots were then rinsed three times (5 min) in TBST and incubated for 1 hour 
at RT with the secondary antibody (anti-rabbit IgG conjugated to horseradish peroxidase 
diluted 1:5000 in TBST; Amersham Biosciences). Membranes were washed three times (5 

15 min) with TBST and immunoreactive proteins were visualized with Enhanced 
Chemiluminescence (ECL) reagents and Hyperfilm ECL X-ray film (Amersham 
Biosciences) according to the manufacturer's instructions. 

To obtain N-terminal sequence, samples were transferred to Sequi-Blot PVDF membrane 
20 (BioRad) equilibrated in electroblotting buffer (10 mM CAPS, 10% (v/v) methanol, 0.01% 
(w/v) SDS, pH 11) using the Mini Trans-Blot cell (BioRad) at 100 V for 45 minutes. 
Following transfer, the membrane was briefly rinsed in Milli-Q water, stained with 
Coomassie Brilliant Blue (0.1% Coomassie Blue R-250, 1% (v/v) acetic acid, 40% (v/v) 
methanol) and then destained (50% (v/v) methanol). Finally the membrane was rinsed with 
25 water and dried before excision of the appropriate proteins for N- terminal sequencing. 
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EXAMPLE3 

Cloning cBNAs encoding gut chymotrypsins from H. punctigera 

cDNAs encoding the H. punctigera chymotrypsins were obtained using two approaches. 
5 Both approaches employed PCR amplification of cDNA produced from midgut mRNA 
extracted from K punctigera larvae at the late fourth and early fifth instar stage of 
development. 

Isolation of chymotrypsin clones using oligonucleotides complimentary to highly 

1 0 conserved regions in H. armigera chymotrypsins 

Bown and colleagues (1997, supra) have described several cDNA clones encoding 
K armigera chymotrypsins. Figure 7 shows the predicted proteins and the regions 
complementary to the oligonucleotides (Figure 8) chosen for PCR amplification of 
chymotrypsin cDNAs from H. punctigera. The PCR products were cloned and sequenced, 

15 and five distinct chymotrypsin sequences were obtained (FlApcr, FIBpcr, F2Bpcr, F3pcr 
and F4pcr, Figure 9). These PCR products were used to screen a cDNA library prepared 
from midgut mRNA isolated from late fourth instar and early fifth instar larvae. Seven 
distinct cDNA clones were isolated encoding chymotrypsins that were divided into four 
families based on sequence identity (Figure 10, Table 2). These chymotrypsins share high 

20 sequence identity with the H. armigera chymotrysins described by Bown et a/., 1997, 
supra and with the small number of chymotrypsin sequences reported for H. virescens and 
H. zea (Table 6). The chymotrypsins encoded by the full length clones presented in Figure 
10 characteristically encode zymogens of approximately 292-295 amino acids, including 
putative ammo-terminal signal peptides of 16-17 residues predicted by the signal peptide 

25 prediction program PSORTII. The presence of the residues IVGG (positions 62-65) at the 
N-terminus of several active chymotrypsins results in the prediction of activation peptides 
ranging from 35-44 residues in length on the zymogens predicted from the cDNA clones. 
Furthermore these activation peptides consistently have the dipeptide arginine-isoleucine at 
their C-termini suggesting a role for trypsin in activating the chymotrypsins. The sequence 

30 identities for the mature domains of each translated protein are presented in Table 2. 
Family 4 was most divergent with less than 20% identity with the other families, while 
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families 2 and 3 shared most similarity with scores of about 85%. The N-terminal sequence 
obtained from the NaPI inhibitable chymotrypsin (Figure 4) matched the N-terminal 
sequence predicted from the cDNA clones encoding chymotrypsins from family 2. The N- 
terminal sequence of the NaPI -insensitive chymotrypsins was not represented in the 4 
5 families of chymotrypsins represented by the cDNA clones. 

TABLE 5 

Protein sequence identity (%) betweeen members of the H. punctigera 
chymotrypsin gene family (mature chymotrypsin domain only) 
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TABLE6 

H. punctigera chymotrypsins are closely related to chymotrypsins 
from oilier Helicoverpa species 
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Protein sequences most similar to H. punctigera were obtained using the BLAST search 
engine. Genbank accession numbers are listed. The mature activated proteins were 
10 compared and percentage of protein sequence identity determined. 

The N-terminal region of the insensitive chymotrypsins had two unique stretches of 
sequence, designated Fl and F2 that were used to design oligonucleotides for PCR 
amplification of DNA encoding one of the insensitive chymotrypsins (Figure 1 1) A 641 bp 

15 fragment of DNA was obtained with the Fl oligonucleotide that encompassed most of the 
protein-coding region. This PCR product was used to screen for a full length clone in the 
H. punctigera midgut library. Approximately 0.2% of the 50,000 plaques screened 
hybridized strongly to the PCR product. Ten plaques were selected and and a full length 
clone was identified and sequenced (Figure 12). The 921 bp clone had an open reading 

20 frame of 828 bp and residues 41-76 of the deduced protein were an identical match to the 
N-terminal sequence obtained from the purified protein (Figure 12). The clone lacked the 
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5'signal peptide sequence, but comparison to the other chymotrypsin clones indicated the 
activation peptide was full length. The active enzyme is predicted to be 236aa in length, 
with a mass of 24.2 kDa. Protein sequence homology was determined by comparing the 
mature domains of each of the K punctigera cDNAs. Family 2 and family 3 were most 

5 similar to HpChS (Rechla), with about 72 and 70% identity respectively. Family 1 
members shared 53-57% identity while Family 4 had about 20% identity with HpCh5. 
Comprehensive Blast searches at the NCBI facility of both translated and protein databases 
produced no significant matches. The insensitive chymotrypsin belongs to a new family of 
Helicoverpa chymotrypsins designated Family 5 (Figure 13). A blast search performed on 

10 a Helicoverpa armigera midgut EST database matched a clone with over 97% sequence 
identity and 100% sequence similarity Figure 14. 

Bovine chymotrypsins. A and B and the NaPI-sensitive chymotrypsin (HpCh2A) were 
compared to the insensitive chymotrypsin (HpChS) to identify regions of variability that 
15 may be involved in low affinity binding to CI (Figure 1 5). 

Ten substitutions identified in the alignment did not appear to fall into functionally 
significant regions, whereas the eleventh substitution was associated with one of the p- 
strands that forms a wall of the primary substrate-binding pocket. The location of this 
20 substitution and conversion to an arginine is highly unusual for the SI domain that is 
predominantly lined with non-polar residues that define chymotrypsin specificity. 

Experimental protocols 

25 Preparation of RNA 

Fifty late fourth and early fifth instar larvae were cold anaesthetized before the gut was 
removed and snap frozen in liquid nitrogen. The 50 gut were ground to a fine powder in 
the presence of liquid N2 using a mortar and pestle and were stored at -70°C. The gut tissue 
(100 mg) was added to 1 mL of TRIZOL (trademark) reagent (Life Technologies) and total 

30 RNA was purified and quantified according to the manufacturer's protocol. 
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RT-PCR amplification of chymotrypsin genes 

Chymotrypsin genes from Helicoverpa armigera were aligned using ClustalW and regions 
of high conservation were identified for design of chymotrypsin specific primers. Primers 
were individually designed to the K armigera clone (CAA72951) due to the high level of 

5 divergence to the other chymotrypsin sequences. Forward primers were used in 
combination with RVG4 to amplify the gene fragments using reverse transcriptase 
polymerase chain reaction (RT-PCR; Superscript Preamplification System, Gibco BRL) 
with the protocols supplied. The PCR products were gel purified (Qiagen gel extraction 
kit), sub-cloned into TOPO PCR2.1 TA cloning vector (Invitrogen) and transformed into 

10 chemically competent E. coli XL 1 -Blue cells (Stratagene) prepared according to the 
method of (Inoue et aL, 1990, supra). Plasmid DNA was prepared using the QIAprep 
(registered trademark) Spin Miniprep Kit (Qiagen). 

Production and screening of a H. punctigera gut cDNA library: 

1 5 Library preparation 

PolyA+ mRNA was isolated from the RNA (1.0 mg) using conventional protocols. A 
biotinylated oligonucleotide (dT) primer that binds to the polyA tail tags the transcripts 
that are subsequently captured on streptavidin paramagnetic particles (PolyATract 
(registered trademark), mRNA Isolation Systems, Promega). Purified mRNA (5 jag) was 

20 used to construct a cDNA library with the Lambda ZAP-cDNA synthesis kit and the Zap- 
cDNA Gigapacklll Gold packaging extract (Stratagene) following the manufacturer's 
instructions. The amplified library titre was 2.8 x 10 10 pfu/mL. 

Labeling probes 

25 RT-PCR products (50 ng) were individually labelled with [a- 32 P] dCTP (Amersham Life 
Sciences) using the MEGAPRIMB (trademark) DNA labeling system labeling (Amersham 
Life Sciences). Unincorporated radiolabeled nucleotides were removed using the Micro 
Bio-spin P-30 chromatography columns (BioRad) according to the manufacturer's 
protocol. The double stranded labelled probes were denatured by boiling (5 min), cooled 

30 on ice, and then added to the hybridization solution (as below). 
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Screening the cDNA library 

The primary screen was performed using ten (15 cm) Petri dishes per probe, with about 
50,000 phage per plate. Preparation of the plates for subsequent plaque lifts and treatment 
of the membranes prior to hybridization was performed according to instructions provided 

5 with the Lambda Zap cDNA synthesis kit (Stratagene). The membranes (Hybond-N; 
Amersham Biosciences) were prehybridized in 50 ml of 2 x PIPES (0.8 M NaCl, 16 mM 
Piperazine-l,4-bis(2-ethanesulphonic acid) pH 6.5) 50% (v/v) formamide, 0.5% (w/v) SDS 
and 100 jig/mL Herring sperm DNA (Boehringer Manningham) at 42°C for 3 hours. 
Following prehyridization, the incubation solution was replaced with 50 ml of 50% (v/v) 

10 formamide, 2 x PIPES buffer, 0.5% (w/v) SDS and 100 jxg/mL denatured herring sperm 
DNA containing the labeled probe and left overnight at 42°C. The hybridization 
membranes were washed three times in 2 X SSPE/0.1% (w/v) SDS at room temperature 
and twice in 0.2 X SSPE/0.1% (w/v) SDS at room temperature, before they were blotted 
on 3 mm Whatman paper and exposed to X-ray film (Kodak XAR-5) for 48 hours at -70°C 

1 5 with intensifying screens. 

Due to crowding and hence overlapping between positive clones and non-specific plaques 
a secondary screen was conducted to isolate individual clones. Two 8.5 cm plates each 
with 50 plaques were used for each probe. The secondary screen was conducted as 
20 described for the primary screen. 

At least 50 positive plaques of varying intensities were selected from each screen probed 
with an individual RT-PCR product. Each plaque was transferred to a 1.5 mL microfuge 
tube containing 1 mL of SM buffer (0.1 M NaCl, 8 mM MgS0 4 .7H 2 0, 50 mM Tris-HCl, 
25 pH 7.5, 0.01% (w/v) gelatin) and chloroform (20 |oL) and stored at 4°C. Initially 10 
plaques for each probe were excised and converted into pBluescriptll SK(-) phagemids 
using the ExAssist/SOLR system (Stratagene). The excised phagemids were transformed 
into the chemically competent E. coli strain XL 1 -Blue cells and plasmid DNA was 
prepared and sequenced. 



WO 2004/094630 



PCT/AU2004/000524 



-84- 

DNA sequencing and analysis 

The cDNA clones were grouped on the basis of restriction fragment patterns obtained 
using combinations of the endonucleases BowHI, XhoU., Kpnl, Sad, SacTL, and Sail 
(Promega). RT-PCR products and cDNA inserts were sequenced in both directions using 

5 Ml 3 universal primers at either Micromon sequencing facility at Monash University 
(Melbourne) or SUPAMAC at the Royal Prince Alfred Hospital in Sydney. The sequence 
data was edited using the BioEdit v5 .0.9.1 software written by Tom Hall, North Carolina 
State University freely available at the web address: www.mbio.ncsu.edu/BioEdit/ 
bioedithtml. Sequence homologies were assessed using the BLASTN search facility at 

10 National Centre for Biotechnology Information (NCBI) and further multiple sequence 
alignments were performed using ClustalW version 1.4. at the Network Protein Sequence 
Analysis facility (NPSA; http://npsa-pbil.ibcp.fr/cgi-bin/align_clustalw.pl ) (Combet et al, 
TIBS. 25: 147-150, 2000). 

15 The web based program 'PSORT II' available at the Human Genome Centre at the 
University of Tokyo (http://psort.nibb.ac.jp/form2.html), was used to predict signal peptide 
cleavage points. UTRscan was used to detect functional elements in the 3' untranslated 
regions of the cDNA clones [Pesole, Trends Genet, 15: 378, 1999]. 
(http://bighost.area.ba.cnr.it/BIG/UTRScan/). 

20 

Isolation of a partial cDNA clone encoding a NaPI- insensitive chymotrypsin 
A sample of RNA previously purified for the production of the cDNA library was thawed 
and used as template for the following RT-PCR reaction. Two 5' degenerate primers (5) 
were designed to unique regions of the N-terminal amino acid sequence and used in 

25 combination with the 3' primer RVG4. First strand cDNA synthesis was achieved using 
the SuperScriptll Preamplification system from Stratagene and was followed by PCR 
amplification of the target cDNA using the Perkin Elmer thermocycler [25 cycles for 1 min 
at 94°C, 1 min at 48°C and 1 min at 72°C then 7 mins at 72°C]. PCR products were 
separated on 1% (w/v) agarose gel (SEAKEM (registered trademark); BioWhittaker 

30 Molecular Applications) and a band of approximately 650 bp was excised and purified 
using the Concert purification system (Gibco). The partial cDNA was cloned into the 
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TOPO PCR2.1-TA vector (Invitrogen) and transformed into E. coli strain XL-BL1 
(Stratagene). 

Isolation of the insensitive chymotrypsin cDNA clone 
5 The cDNA library prepared from fourth instar larval gut was screened using a partial 
fragment cloned according to the techniques described above 

EXAMPLE 4 
Homology modeling of the H. punctigera chymotrypsins 

10 

The deduced amino acid sequences from the cDNA clones HpF2B (sensitive) and HpF5 
(insensitive) were modeled on the structures of the Bos taurus (bovine) and fire ant 
chymotrypsins, obtained from the Research Collaboratory for Structural Bioinformatics 
(RCSB) Protein Data Bank site (http://www.rcsb.org/pdb/). The Helicoverpa 

15 chymotrypsins are predicted to adopt similar structures to those reported for all the 
chymotrypsin structures available in the PDB databank. The modeled structure consists of 
the classic serine protease fold consisting of two, six-stranded anti-parallel p barrels with 
the catalytic triad located between the two domains. Two surface loops, 60 and 142 are 
considerably larger in the H punctigera chymotrypsins (Figures 15 and 16). Due to the 

20 limitations of modelling, a small amount of ambiguity was present in several surface loops, 
some of which are cleaved in mammalian chymotrypsins (loop 142), but remain intact 
within insect chymotrypsins. The only reported crystal structure of an insect chymotrypsin 
is from the fire ant, Soenopsis invicta (Botos et al, Journal of Molecular Biology 298: 
895-901, 2000) and this was used to help refine the orientation of the surface loops on the 

25 model of the Helicoverpa chymotrypsin, 

CI was modeled in complex with sensitive and insensitive chymotrypsins to investigate 
whether substitution of glutamine (or asparagine) 192 (Greer nomenclature, Figure 15) 
with an arginine would affect the binding capacity of the Helicovopa chymotrypsins. The 
30 structure of the chymotrypsin inhibitor (CI) was previously determined by 1H NMR 
(Nielson et al, 1994, supra). No structures of CI complexes have been solved and 
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therefore the related proteinase inhibitor PCI-1 from Solarium tuberosum in complex with 
Proteinase B from Streptomyces griseus (Greenblatt et al, 1989, supra) provided a basis 
for structural modeling. Figure 17 illustrates the binding region surrounding Gin 192 in the 
Cl-HpF2B chymotrypsin complex. The predicted model of HpF2B and CI shows that 
5 glutamine 192 is not in conflict with any regions on the inhibitor molecule. Comparison to 
the cognate arginine residue in the insensitive chymotrypsin however suggests there is 
limited space to accommodate this much larger residue (Figure 18). The modeling viewing 
program 'Spdbv', predicted Arg 192 to be in direct conflict with threonine 5 of CI. PotI 
has been identified as a strong inhibitor of the insensitive chymotrypsin from H. 
10 punctigera and was, therefore, modelled in complex with the insensitive chymotrypsin 
(Figure 19). Unlike CI, PotI easily accommodates arginine 192 upon binding to the 
insensitive chymotrypsin. 

In summary, the NaPI-insensitive chymotrypsin from Helicoverpa species has an arginine 
15 in place of an asparagine or glutamine at position 192 that extends into the SI binding 
pocket and appears to interfere with CI binding. Furthermore, it is clear that this arginine 
residue does not interfere with PotI binding, consistent with the observation that PotI is a 
much more efficient inhibitor of insect chymotrypsins than the NaPI inhibitors. 

20 EXAMPLE 5 

Production and characterization of polyclonal antisera to the NaPI inhibitable 
and NaPI- insensitive chymotrypsins from H. punctigera 

Chymotrypsin clone HpF2B was expressed in E. coli fused to a six histidine (6.H) tag at 
25 the C-terminus and was purified to homogeneity on Talon metal affinity resin (Figure 20) 
for injection into a rabbit for production of polyclonal antibodies. N-terminal sequencing 
of the purified product confirmed the expression of the NaPI inhibitable chymotrypsin 
(HpCh2B). After the fourth boost with antigen, the serum was collected and tested on 
protein blots of bacterially expressed protein and unfractionated gut extracts. The antibody 
30 detected the full-length recombinant chymotrypsin at a dilution of 1 in 2500 as well as 
several break-down products. Unfractionated gut extract and a sample of protein bound to 
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the CI affinity column were also stained with the anti-HpCh2B antibody which detected 
the mature native form of the enzyme. 

Purified 6H.HpCh2B was used to test the detection limit of anti-HpCh2B antibody by 
5 comparison of immunoblots to silver stained SDS-PAGE gels. The antibody detected 20 
ng of bacterially expressed chymotrypsinogen and also recognized the mature form of the 
native chymotrypsin isolated from gut ofH. punctigera. 

The cDNA (HpF5) encoding the NaPI-insensitive chymotrypsin (HpCh5) was expressed in 
10 E. coli in a similar manner except the six.histidine tag was fused to the N-terminus of the 
expressed protein. The polyclonal antiserum that was raised against the bacterially 
expressed NaPI inhibitable chymotrypsin (HpCh2B) did not cross-react with bacterially 
expressed NaPI-insensitive chymotrypsin (HpCh5) on protein blots (Figure 21). Likewise 
the antiserum raised against HpChS did not bind to HpCh2B. This indicates that these 
15 antisera can be used to specifically distinguish between and monitor levels of the NaPI- 
insensitive and sensitive chymotrypsins in unfractionated gut extracts. 

Experimental protocols 

20 Preparation of antigen for immunization 

The cDNA clone 'HpF2B'which encodes an NaPI sensitive chymotrypsin (Figure 1 1) was 
amplified by polymerase chain reaction (PCR) using two oligonucleotides (Table 7) that 
incorporated Ncol and BgUl restriction sites at the 5' and 3' ends of the cDNA 
respectively. 

25 



WO 2004/094630 



PCT/AU2004/000524 



-88- 
TABLE 7 

Degenerate primers designed to two unique regions in the N-terminus of the 
insensitive chymotrypsin protein. Primer positions are shown in the amino acid 

sequence by matching typeface 



5 



N-terminal sequence of NaPI-insensitive chymotrypsin 
rVGGSLSSVGQIPYQAGLFZDZ^GGQAVCGGSLISA [SEQ ID NO:9] 


Primer. Name* 




Fw2ResChy 


TC(AGCT) GT(AGCT) GG(AGCT) CA(AG) AT(ACT) CC [SEQ ID 
NO: 10] 


FwResChym 


GT(AGCT) AT(ACT) GA(CT) CT(AGCT) GC(AGCT) GG(AGCT) 
GGTSEOIDNO:!!] 



TABLE 8 

PCR amplification primers for bacterial expression of chymotrypsin HpF2B 



Prime'rNam'e',' : ';rt 


Restriction 


" Y# ; - y> Sequence . • . .• ji-v • 
:V- , :-r' : ;*f ' ~ »••*, :3w*tatf r • ■ "• "•••v; •• •" ■'•;•{ 


5' Hc35PQE-60Fw 


Ncol 


TTA ACC ATG GTG ATC GAC CTC [SEQ ID 
NO: 12] 


Hc35PQE-60Rv 


Bgm 


GAT GAG ATC TGA GAC GTT GGT TG [SEQ ID 
NO: 13] 



10 

The amplified region consisted of the pro-peptide and mature domain of the 
chymotrypsinogen, but lacked the putative secretion signal. Digests using Ncol and BgRL 
enzymes (Promega) were performed on the PCR amplified product and the pQE-60 
expression vector (Qiagen). The pQE-60 vector provides a His-tag at the C-terminus of the 

15 expressed protein. Each restriction digest was purified using WIZARD (registered 
trademark) DNA clean up system (Promega) and the vector and chymotrypsin insert were 
subsequently ligated using standard molecular biology techniques (Sambrook and Russell, 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 3rd 
edition 2001) The ligation mix was heated at 65°C for 10 min before being transformed 

20 into the E. coli strain XLl-Blue. Plasmid DNA was prepared for sequencing and 
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subsequent transformation into the E. coli cell line M15 (Qiagen) using the QIAPREP 
(registered trademark) Spin Miniprep Kit (Qiagen). 

Expression and purification of the recombinant chymotrypsinogen was performed under 
5 denaturing conditions according to the methods detailed in the QiaExpress manual 
(Qiagen). The purity of the expressed chymotrypsinogen was assessed by SDS-PAGE and 
the identity of the recombinant protein confirmed by N-terminal sequencing. Preparation 
of the chymotrypsin for injection consisted of removal of the urea by dialysis against 50 
mM Tris-HCl pH 8.0. During this process most of the protein aggregated. The aggregated 
10 protein was collected by centrifiigation and resuspended in 1 mL of 50 mM Tris-HCl, pH 
8.0 for injection. The protein concentration was approximated by the comparison of a 10 
|LiL sub-sample to a series of bovine trypsin standards (Sigma) using SDS-PAGE and 
Coomassie staining. 

15 The cDNA clone 'HpF5' which encodes the NaPI-insensitive chymotrypsin (Figure 12) 
was PCR amplified essentially as described for clone HpF2B above except the forward 
primer (FwpMalRECH ) incorporated a Histidine tag at the N-terminus of the expressed 
protein and the reverse primer (RvRECH) contained a stop codon and thus prevented 
incorporation of a Histidine tag at the C-terminus from the pQE-60 expression vector. 

20 
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EXAMPLE 6 

Bioassays with Potl inhibitor and Helicoverpa larvae 

Large quantities of the Potl inhibitor were purified from potato tubers to evaluate the 
5 combined effect of NaPI and Potl on the growth of H.armigera larvae. Bioassays 
confirmed that addition of Potl significantly enhances the activity of the NaPI inhibitors. 
Caterpillars fed NaPI and Potl in combination (0.26 and 0.34% (w/v), respectively) were 
34% the size of control larvae at the fifth instar stage of development whereas caterpillars 
feeding on NaPIs alone were about 84% the size of the controls. Based on these results we 

10 decided to clone genes encoding the Potl inhibitors for transfer into cotton plants. Two 
Potl genes (PotIA and PotIB) were isolated using mRNA isolated from potato tubers and 
wounded potato leaves. These cDNA clones were used to construct vectors for bacterial 
expression of the Potl proteins. The bacterially produced Potl proteins totally inhibited the 
• NaPI-insensitive chymotrypsin isolated from H. punctigera larvae. Both Potl genes were 

15 incorporated into constructs for cotton transformation. Purified Potl inhibitor from potato 
tubers was used to generate specific antibodies that were used to monitor levels of Potl 
produced by transgenic plants. 

Experimental protocols 

20 

Purification of Potl inhibitor from potato tubers 

Potato tubers (2 Kg, Solanum tuberosum var Russet Burbank) were diced and soaked 
overnight at 4°C in two litres of 50 mM H 2 S0 4 . The tissue was homogenized in a blender 
and insoluble material was removed by filtration through two layers of Miracloth followed 

25 by centrifugation (13,000 rpm, 1 5 min, 4°C). The supernatant was adjusted to pH 7.8 with 
10 M NaOH, heated for 30 min in boiling water and cooled before precipitated material 
was separated by centrifugation (13,000 rpm, 12 min, 4°C). Soluble proteins were 
precipitated with ammonium sulphate (80% saturation), collected by centrifugation and 
redissolved in gel filtration buffer (150 mM KC1, lOmM Tris-HCl, pH 8) before they were 

30 applied to a Sephadex G75 column (85 x 2.54cm). Elution fractions (50mL) containing 
Potl were identified using SDS-polyacrylamide gel electrophoresis and inhibition assays 



WO 2004/094630 



PCT/AU2004/000524 



-91- 

with bovine chymotrypsin. PotI containing fractions were pooled, dialysed and freeze 
dried. The protein in these pooled fractions was examined by reverse phase -HPLC on a 
system Gold HPLC (Beckman, Fullerton, CA) coupled to a 166 detector (Beckman). The 
analytical RP-HPLC was conducted on a Brownlee Aquapore RP300 C8 column (4.6 x 
5 100mm; Perkin-Elmer). The protein was eluted with a linear gradient of 0-100% (v/v) 
buffer B (60%[v/v] acetonitrile in 0.089% [v/v] trifluoroacetic acid) at a flow rate of 1 mL 
min-1 over 40min (Figure 22). The identity of the PotI proteins was confirmed by N- 
terminal sequencing and mass spectrometry and at least two PotI isoforms were identified. 

1 0 Growth ofH. armigera larvae on artificial diet containing NaPI and PotI 

PotI protein isolated from potato tubers and NaPI peptides isolated from N. alata stigmas, 
were incorporated into a cotton leaf based artificial diet and fed to Karmigera neonates 
(Figure 23). PotI had an additive affect on larval growth. Larvae fed NaPI only were 
inhibited in growth by 16% compared to control larvae while larvae fed both NaPI and 

1 5 PotI were inhibited in growth by 75% compared to control larvae (Figure 23). 

Isolation ofcDNAs encoding Potato proteinase inhibitor I (PotI) 

Potato proteinase inhibitor I cDNA was synthesized using reverse transcriptase-PCR and 

total RNA from potato tubers or wounded potato leaves, First strand cDNA was prepared 

20 using Thermoscript RT-PCR with Oligo (dT) 2 o primers (Life Technologies). PotI cDNA 
sequences were subsequently amplified using gene specific primers 5' CGG-GAT-CCA- 
TGG-AGT-C AA- AGT-TTG-C-3 ' [SEQ ID NO:14] (sense) and S'-GCG-TCG-ACG-CTT- 
AAG-CCA-CCC-TAG-G-3' [SEQ ID NO: 15] (antisense) that were designed to anneal to 
the 5' and 3' ends of the open reading frame of the published PotI genomic sequence 

25 M17108 (Cleveland et al> Plant Mol Biol 8: 199-207, 1987) and included restriction sites 
Bam HI and Sal I respectively. 

Two PotI homologs were isolated. StPotIA was derived from wounded leaf RNA and 
StPotB was derived from tuber RNA. StPotIA and StPotiDB cDNA share 92.6% nucleic 
30 acid sequence identity and share 86% amino acid sequence identity (Figure 25). 
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The predicted amino acid sequence and comparison to other members of the Potato 
Inhibitor I family are presented in Figure 24. StPotIB is very similar to published PotI 
sequences from potato tuber and shares a methionine at the PI reactive site. In contrast, 
StPotIA contains an alanine residue at PI and also has an additional four amino acids at 
5 positions 41 to 44. Additional amino acids in this position have not been reported for other 
potato PotI isolates although they have been found in a wound induced PotI from tomato. 

Expression ofStPolIA andStPoHB in E. coli 

DNA encoding PotI without the endoplasmic reticulum signal sequence (amino acids 37- 
10 1 1 1 in StPotIA and 37-107 in StPotIB) was amplified by PCR. Primers used were 5'-CGG- 
GAT-CCA-AGG-AAT-CGG-AAT-CTG-3' [SEQ ID NO:16] (StPotIA sense), 5'-CGG- 
GAT-CCA-AGG-AAT-TTG-AAT-GC-3 1 [SEQ ID NO:17] (StPotIB sense) and 5'-CGA- 
GCT-CTT-AAG-CCA-CCC-TAG-G-3' [SEQ ID NO: 18] (StPotlA/B antisense). PCR 
products were initially cloned into the pGEM T-Easy vector (Promega) before they were 
15 excised with BamHL and Sacl and ligated into the bacterial expression vector pQE30 
(Qiagen) which provides a 6x His-tag at the N-terminus of the expressed protein. 

The His-tagged PotI proteins were expressed in E. Coli (BR21 DE3 Codon Plus strain 
(Stratagene) for StPotIA and Ml 5 strain (Qiagen) for StPotIB). Cells were induced with 1 

20 mM IPTG, harvested by centrifugation and lysed in 8 M urea, 0.1 M NaH 2 P0 4 , 0.01 M 
Tris-HCl, pH 8.0. Cell debris was removed by centrifugation at 10,000 g for 5 min and the 
His-tagged PotI was purified from the supernatant by metal-affinity chromatography on 
Talon resin (Clontech). Bound protein was eluted from the resin with 8 M urea, 0.1 M 
NaH 2 PO„, 0.01 M Tris-Cl, pH 4.0 and elution was monitored by SDS-PAGE. The StPotIA 

25 and StPotIB proteins were purified further by RP-HPLC (Figure 25) their identity was 
confirmed by N-terminal sequencing and mass spectrometry. 
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Inhibition of the NaPI-insensitive chymotrypsins from H. punctigera by StPotIA and 
StPotIB 

The inhibitory activity of StPotIA and StPotIB against the NaPI-insensitive chymotrypsins 
from H punctigera was determined by preincubating the NaPI-insensitive protease (10 \iL) 
5 with varying amounts of StPotIA and StPotIB (0-600 nM) in 133 mM CAPS buffer, pH 
10.0 at 30°C in 96 well microtitre plates. After the 30 min preincubation, the incubation 
was started by the addition of substrate (SA 2 PFpNA, SA 2 PLpNA or SA 2 PMpNA) to a final 
concentration of 1 mM in a final volume of 100 jxL. Absorbance was measured at 405 nm 
after 30 or 60 min, 

10 

StPotIA, StPotIB and the mix of PotI isoforms isolated from potato tuber inhibited the 
NaPI-insensitive chymotrypsin in the Helicoverpa punctigera gut extract (Figure 26). At 
least 75% of SA 2 PFpNA or SA 2 PLpNA hydrolysis by the NaPI-insensitive chymotrypsin 
was inhibited by the addition of 300 nM of StPotIA, StPotIB or a mix of PotI isoforms 
15 isolated from tuber. Inhibition of SA 2 PMpNA hydrolysis was lower (40%). 

Production of transgenic cotton expressing NaPJ and StPotIA 

Two gene constructs (pHEX2 and pHEX6) were prepared for transformation of cotton 
(Gossypium hirsutum). pHEX2 consists of a 35S promoter driving the NaPI gene with a 
20 35S terminator, inserted into the binary vector pBIN 19 (Bevan, Nucl. Acids Research, 12: 
871 1-8721, 1984). pHEX6 consists of a 35S promoter driving the StPotIA gene with a 35S 
terminator inserted into the binary vector pBIN 19. 

Transgenic cotton was produced using the method of Umbek et al., Biotechnology, 5: 263- 
25 266, 1987) with modifications. Hypocotyl sections of cotton Cv Coker 315 were co- 
cultivated with Agrobacterium tumefaciens strain LBA 4404 containing the required 
binary vector. Callus was induced on media consisting of MS salts, B5 vitamins, 3% 
glucose, 0.9 g/L MgCl 2 (hexahydrate), 1.9 g/L potassium nitrate, 2 g/L Gelrite, 0.1 mg/L 
Kinetin, 0.1 mg/L 2,4-D, 500 mg/L carbeniciilin, 35 mg/L Kanamycin. Embryogenic 
30 callus was induced by growing the callus on the same media but without hormones. 
Embryos were excised and incubated on media in petri dishes (Stewart and Hsu, Planta 
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137: 113-117, 1977). Germinated embryos that had produced roots and true leaves were 
transferred to containers for further development and then transferred to soil and grown in 
a growth cabinet at 27°C. 

5 Production of transgenic cotton expressing NaPI and StPotIA 

To produce plants expressing both genes, pollen from a transgenic line expressing NaPI 
was used to pollinate a flower from a plant expressing StPotIA and the seed collected. One 
progeny plant (plant 3) was identified as expressing both genes by immunoblot analysis. 

10 Leaves from plant 3 were used in a bioassay with H. armigera (Figure 27). While 
expression of either NaPI or StPotIA in the leaves only resulted in a small inhibition of 
larval growth compared to the control, expression of both proteins had a synergistic effect 
on larval growth. 

15 EXAMPLE 7 

Baculovirus expression ofHpChS 

Addition of the signal peptide sequence to HpChS 

While the HpF5 clone encoded the entire chymotrypsinogen sequence of HpCh5 with the 
20 activation domain it did not encode the signal peptide required for correct targeting of the 
protein to the endoplasmic reticulum (ER). For baculovirus expression, an ER signal 
sequence was added to the HpF5cDNA using two overlapping oligonucleotides 
corresponding to the 19 amino acids of the ER signal peptide from HpCh2A (Figure 10). 
The ER signal sequence was added to the preactivation and mature domains of HpChS in a 
25 two-step PCR (Table 9). 
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TABLE9 

PGR amplification primers for addition of an ER signal sequence to HpChS 



.Primer 


• ■•' .. ■ "'' Sequenced •>.?•• s. n-iv—-- 
•:j-.?v .• • •• . ■ . . ">• -.;;■:•.: ...»• •. c •• 


FWBacRECHl (5'-3') 


TTG GCT TTC GCC GCG GTC GTC TCC GCG AGG AAC 
GGG TCC C rSEQ ID NO: 191 


FWBacRECH2 (5'-3') 


GGA TCC ATG AAA CTC TTG GCT GTG ACT CTA TTG 
GCT TTC G rSEQ ED NO:20] 


RvRECH(3'-5') 


G ATC AAC GGC CAG CTC TAA AAG CTT [SEQ ED 
NO:21] 



5 The first PCR used primers FwBacRECHl and RvRECH with the HpF5 cDNA template to 
add the first half of the ER signal sequence. The second PCR used FwBacRECH2 together 
with RvRECH and the product of the first PCR reaction as template. At each step, the 
amplification products were purified after electrophoresis on 0.7% (w/v) agarose gels 
before they were used for subsequent PCRs. 

10 

Cloning of HpFS/ER into pFastBac vector 

The HpF5 cDNA with the ER signal sequence (HpF5/ER) was subcloned into the pCR 
(registered trademark)-2.1 TOPO vector (Invitrogen) and was sequenced at the Micromon, 
DNA sequencing facility, Monash University, Victoria, Australia. Recombinants with the 
1 5 correct sequence were digested with EcdKL and gel purified before they were digested with 
BamYQ. and Hindia and ligated into the pFastBac vector (Invitrogen) and transformed into 
E. coliXLl Blue cells. 

Transposition of pFastBac/HpF5/ER construct into E. coliDMOBac cells 
20 E. coli XL1 Blue cells were screened for the presence of the HpF5/ER cDNA in the 
pFastBac vector (pFastBac/HpF5/ER) by PCR and restriction digest. Minipreps were 
performed on positive transformants. E. coli DHlOBac competent cells containing bacmid 
DNA and the helper plasmid required for transposition of HpF5/ER to the bacmid DNA 
were thawed on ice. Approximately 1 ng of pFastBac/HpF5/ER recombinant plasmid was 
25 added to the cells (150 uL) and after gently mixing the mixture was transferred to a pre- 
chilled GENE PULSER (registered trademark)/£. COLI (trademark) pulser cuvette. The 
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cuvette was placed in the electroporation apparatus (BioRad) and a pulse of 1.7 Amps was 
applied. LB (1 mL) was then added and the sample was transferred to a 10 mL capped tube 
and allowed to recover on a shaking incubator (190 rpm) at 37°C for 4 hours. A sample 
was withdrawn and serially diluted (10' 1 , 10" 2 , and 10°) using LB medium before 100 ^iL 
5 of each dilution was spread evenly onto LB agar plates containing 50 \xg/mL kanamycin, 7 
jxg/mL gentiamicin, 10 (xg/mL tetracycline, 100 (xg/mL 5-bromo-4-chloro-3-indolyl-p-D- 
galactopyranoside (X-Gal) and 40 \xg/mL IPTG prior to incubation for 48 hours at 37°C. A 
pFastBac vector with no insert was treated the same way and used as a control. 

1 0 Isolation of recombinant bacmid DNA 

Recombination between the pFastBac/HpF5/ER vector and the bacmid DNA was detected 
by the presence of white colonies as apposed to blue colonies, indicating a disruption in the 
lacZa gene. To confirm the recombination was stable, the colonies were restreaked on 
selective media and incubated for 24 hours. White colonies were used to inoculate 10 mL 

15 LB media containing 50 ^g/mL kanamycin, 7 jxg/mL gentiamicin and 10 |4g/mL 
tetracycline for isolation of the recombinant bacmid DNA. The same procedure was 
performed with a blue colony (not recombinant) selected as a control. Cells were grown 
overnight at 37°C with shaking at 190 rpm. Cultures were centrifuged at 14,000 x g for 5 
min. The supernatant was discarded and the pellet was resuspend in 0.3 ml of Solution I 

20 (15 mM Tris-HCl, pH 8.0, 10 mM EDTA, 100 |^g/mL RNase A]. Solution II (0.2 N 
NaOH, 1% (w/v) SDS) was then added and gently mix and incubated at room temperature 
for 5 min. Potassium acetate (0.3 mL, 3 M, pH 5.5) was slowly added and the samples 
were placed on ice for 10 min before, they were centrifuged for 10 min' at 14,000 x g to 
remove the thick white precipitate. The supernatant was then transferred to a 1.5 mL 

25 microfiige tube containing 0.8 ml absolute isopropanol and was mixed by inversion. After 
10 min on ice samples were centrifuged for 15 min at 14,000 x g at room temperature. The 
supernatant was removed and pellet was washed twice with 0.5 mL of 70% (v/v) ethanol. 
The sample was then centrifuged for 5 min at 14,000 x g at room temperature before the 
70% (v/v) ethanol was removed and the DNA pellet was air dried. The DNA was then 

30 dissolved in 60 ^L TE buffer (10 mM Tris-HCl, 1 mM EDTA, pH 8.0) and stored at -20°C 
until needed. The bacmid was examined by electrophoresis on a 0.5% (w/v) agarose gel 
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prepared in TAE (40 mM Tris-acetate, 1 mM EDTA, pH 8) buffer at 23 V for 12 hours 
with X DNA///wrfni Fragment markers (MM Fragments). The bacmid was also screened 
using PCR with Ml 3 Forward and reverse primers. 

5 Transfection of HIGH FIVE (trademark) cells with recombinant bacmid DNA 

Approximately 9 x 10 s HIGH FIVE (trademark) insect cells were placed into a 35-mm 
well in a 6-well plate (NUNCLON (trademark) A Surface) in 2 ml of EX-CELL 
(trademark) 405 media with 50 units/mL penicillin and 50 ug/mL streptomycin (JRH 
Biosciences). Cells were left at 27°C for 1 hour to allow them to attach to the plate. Two 

10 solutions were prepared for each transfection each adding a different amount of bacmid 
miniprep (5, 10 or 20 uL); Solution A: mini-prep of bacmid DNA into 100 uL EX-CELL 
(trademark) 405 media without antibiotics, and Solution B: 6 uL of CELLFECTIN 
(registered trademark) Reagent (Gibco BRL) into 100 uL EX-CELL (trademark) 405 
media without antibiotics. Solutions A and B were combined and incubated for 1 hour at 

15 room temperature followed by 0.8 mL of selection free media added. After the cells had 
attached to the plate, they were washed once with 2 mL media without antibiotics. The 
media was removed from cells and was replaced with the DNA containing solution. Cells 
were incubated for 5 hours in a 27°C incubator before the transfection mixture was 
aspirated and replaced with 2 mL of media containing antibiotics. Cells were incubated 

20 further at 27°C for up to 72 hours. 

After 24, 48 and 72 hours, 2 mL of the supernatant was transferred to a 10 mL sterile 
capped tube and centrifuged for 5 min at 500 x g. The virus-containing supernatant was 
transferred to a fresh tube and stored at 4°C, protected from light until required. 

25 

Infection of insect cells with recombinant baculovirus particles 

Two 10 mL cultures were set up for each transfection. Each culture was infected with 100 
ul of harvested virus (~2 x 10 7 pfu/mL). Infected cells were harvested at 24, 48, and 72 
hours. These samples were later analyzed for protein expression by immunoblot analysis 
30 with the anti-HpCh5 antibody. 
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EXAMPLE 8 

Baculovirus expression ofHPCHS 

Primer design for ER signal sequence 
5 The original HpF5 cDNA clone did not encode an ER signal sequence. While the ER 
signal sequence was not required for bacterial expression it was essential for baculovirus 
expression. An ER signal sequence was thus constructed using the DNA sequence from 
chymotrypsin family 2A which is most closely related to the chymotrypsin family 5. 

10 The sequence was added using PCR reactions with two primers encoding part of the ER 
signal sequence. The FwBacRECHl primer had a silent mutation to remove the BamlU 
restriction digest site and encoded half the ER signal sequence. The FwBacRECH2 primer 
encoded the remainder of the ER signal sequence and introduced a BctrnHl restriction site 
(Figure 28). 

15 

Amplification and ligation ofHpChS into the pFastBac vector 

PCR using primers the FwBacRECHl and RvRECH and HpF5cDNA as the template 
yielded a product of ~900 bp. A second PCR using this PCR product together with 
FwBacRECH2 and RvRECH primers was performed to yield a product of ~930 bp that 

20 encoded the HpCh5 protein with an ER signal sequence. The amplified product was 
subcloned into pCR (registered trademark)-2.1 TOPO vector (Invitrogen) before 
transfection into TOP10 competent cells. Colonies were screened for the presence of insert 
using M13 forward and reverse primers. Restriction digests of the isolated plasmids with 
EcoKL then subsequently with BamHL and HindlE yielded a product of the expected size of 

25 -930 bp. 

The ~930 bp fragment was subsequently ligated into the pFastBac vector to create 
pFastBac/HpF5/ER. The presence of the HpF5 cDNA insert in the pFastBac vector was 
confirmed by restriction digests with BamYQ. and HirvSH. and PCR using FwBacRECH2 
30 and RVRECH primers (Table 9). Plasmids containing the insert were sent to Micromon for 
DNA sequencing. 
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Analysis o/BacmidDNA 

The transfection of the recombinant pFastBac vector into DHlOBac cells resulted in the 
transposition of the HpF5 insert into the bacmid DNA. Recombinant bacmid DNA was 
5 isolated from the transfected cells and was separated on a 0.5% (w/v) agarose gel. 

The bacmid was checked for the HpF5 cDNA insert by PCR analysis with M13 forward 
and reverse primers. The expected -3300 bp fragment was further analysed using the 
primers to HpF5/ER cDNA and Ml 3 to ensure the insert was in the correct orientation. 
10 The positive bacmid DNA was subsequently used for the transfection of insect cells. 

Virus formation and production ofHpChS in the baculovirus expression system 
Three different concentrations of bacmid DNA were used to transfect the insect cells for 
production of baculovirus. After transfection, the cells were incubated at 27°C for 72 hours 
15 before the culture medium containing the virus was collected. The medium was tested by 
immunoblot analysis with the a-HpCh5 antibodies for the production of HpChS protein, 
which indicated virus was being produced (Figure 29A). Many controls were used to 
monitor the transfection. These included pFastBac vector alone, transfection of DNA from 
a blue colony, incubation of cells with CELLFECTIN (registered tradmark) alone and non- 
20 transfected cells (Figure 29A). The medium containing virus was used in a subsequent 
experiment to infect more insect cells on a larger scale and a time course was performed 
over a 72 hour period to monitor the regulation of HpCh5 expression (Figure 29B). 

Those skilled in the art will appreciate that the invention described herein is susceptible to 
25 variations and modifications other than those specifically described. It is to be understood 
that the invention includes all such variations and modifications. The invention also 
includes all of the steps, features, compositions and compounds referred to or indicated in 
this specification, individually or collectively, and any and all combinations of any two or 
more of said steps or features. 
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CLABMS 

1. An isolated nucleic acid molecule comprising a sequence of nucleotides 
encoding a chymotrypsin from Helicoverpa spp. or a variant, derivative, homolog or 
analog of said chymotrypsin, wherein said chymotrypsin exhibits resistance to a proteinase 
inhibitor (PI) from Nicotiana data. 

2. The isolated nucleic acid molecule of Claim 1 wherein the nucleotide sequence 
encodes an amino acid sequence set forth in SEQ ID NO:2 or an amino acid sequence 
having at least about 75% similarity to SEQ ID NO:2 after optimal alignment. 

3. The isolated nucleic acid molecule of Claim 2 wherein the nucleotide sequence 
encodes an amino acid sequence set forth in SEQ ED NO:2. 

4. The isolated nucleic acid molecule of Claim 1 wherein the nucleotide sequence 
is as set forth in SEQ ID NO:4 or SEQ ID NO:6 or a nucleotide sequence having at least 
about 75% identity to SEQ ID NO:4 or SEQ ID NO:6 after optimal alignment or a 
nucleotide sequence capable of hybridizing to SEQ ID NO:4 or SEQ ID NO:6 or its 
complementary form under low stringency conditions. 

5. The isolated nucleic acid molecule of Claim 4 comprising a nucleotide 
sequence as set forth in SEQ ID NO:4. 

6. The isolated nucleic acid molecule of Claim 1 encoding a variant of the 
chymotrypsin wherein the variant is an N-terminal signal sequence comprising an amino 
acid sequence set forth in SEQ ID NO:3 or an amino acid sequence having at least about 
75% similarity to SEQ ID NO:3 after optimal alignment. 

7. The isolated nucleic acid molecule of Claim 6 wherein the nucleotide sequence 
is set forth in SEQ ID NO:5 or a nucleotide sequence having at least 75% identity to SEQ 
ID NO:5 after optimal alignment or a nucleotide seuence capable of hybridizing to SEQ ID 
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NO:5 or its complementary form under low stringency conditions, 

8. The isolated nucleic acid molecule of Claim 1 encoding a variant of the 
chymotrypsin wherein the variant comprises an amino acid other than arginine at position 
192. 

9. The isolated nucleic acid molecule of Claim 8 wherein the variant comprises a 
glutamine at position 192. 

10. A vector comprising a nucleic acid molecule of any one of Claims 1 to 9. 

1 1 . The vector of Claim 8 wherein the vector is an expression vector. 

12. The vector of Claim 10 wherein the expression vector is operable in a 
prokaryotic cell. 

13. The vector of Claim 11 wherein the experssion vector is operable in a 
eukaryotic cell. 

14. The vector of Claim 1 3 wherein the eukaryotic cell is an insect cell. 

15. The vector of Claim 14 wherein the vector is a baculovirus vector. 

16. A genetically modified cell comprising a nucleic acid molecule of any one of 
Claims 1 to 9. 

1 7. The genetically modified cell of Claim 1 6 wherein the cell is a prokaryotic cell. 

1 8. The genetically modified cell of Claim 1 6 wherein the cell is a eukaryotic cell. 

19. An isolated chymotrypsin from Helicoverpa spp. wherein said chymotrypsin 
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exhibits resistance to a PI from N. alata or a variant, derivative, homolog or analog of said 
chymotrypsin. 

20. The isolated chymotrypsin of Claim 19 comprising an amino acid sequence set 
forth in SEQ ED NO:2 or an amino acid sequence having at least about 75% similarity to 
SEQ ID NO:2 after optimal alignment. 

21. The isolated chymotrypsin of Claim 20 comprising an amino acid sequence as 
set forth in SEQE)NO:2. 

22. The isolated chymotrypsin of Claim 19 encoded by a nucleotide sequence 
having at least about 75% identity to SEQ ED NO:4 or SEQ ID NO:6 after optimal 
alignment or a nucleotide sequence capable of hybridizing to SEQ ID NO:4 or SEQ ID 
NO:6 or its complementary form under low stringency conditions. 

23. The isolated chymotrypsin of Claim 22 encoded by a nucleotide sequence set 
forth in SEQ ID NO:2. 

24. The isolated chymotrypsin of Claim 19 wherein the variant is an N-terminal 
signal sequence. 

25. The isolated chymotrypsin of Claim 24 wherein the signal sequence comprises 
an amino acid sequence as set forth in SEQ ID NO:3 or an amino acid sequence having at 
least 75% similarity thereto after optimal alignment. 

26. The isolated chymotrypsin of Claim 19 encoded by a variant comprises an 
amino acid other than arginine at position 192. 

27. The isolated chymotrypsin of Claim 19 encoded by a variant comprises a 
glutamine at position 192. 
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28. The isolated chymotrypsin encoded by a nucleic acid molecule of any one of 
Claims 1 to 9. 

29. An antagonist of a chymotrypsin as defined in any one of Claims 1 9 to 28. 

30. The antagonist of Claim 29 wherein the antagonist binds or interacts with the 
chymotrypsin at or near amino acid residue position 192, 

3 1 . The antagonist of Claim 29 or 30 wherein said antagonist is Potl. 

32. A composition comprising an antagonist of Claim 29 or 30. 

33. An anti-insect composition comprising an antagonist of Claim 29 or 30. 

34. A genetically modified plant comprising cells capable of producing an 
antagonist of a chymotrypsin as defined in any one of Claims 19 to 26 or expression of a 
nucleic acid molecule as defined in any one of Claims 19 to 26. 

35. A genetically modified plant comprising cells capable of producing an 
antagonist of chymotrypsin-HpChS or HpF5 gene expression. 

36. The genetically modified plant of Claim 34 or 35 wherein the plant is a 
monocotyledonous plant. 

37. The genetically modified plant of Claim 34 or 35 wherein the plant is a 
dicotyledonous plant. 

38. The genetically modified plant of Claim 34 or 35 wherein the plant is a crop 
plant, vegetable plant or ornamental plant or tree. 

39. The genetically modified plant of Claim 34 or 35 wherein the plant products 
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Potl. 

40. The genetically modified plant of Claim 37 wherein the plant is cotton, sweet 
corn, tomato, tobacco, piniento, potato, sunflower, citrus, plums, sorghum, leeks, soybean, 
alfalfa, beans, pidgeon peas, chick peas, artichokes, curcurbits, lettuce, Dianthus (an 
ornamental plant), geraniums, cape gooseberry, maize, flax and linseed, alfalfa, lupins, 
broad beans, garden peas, peanuts, canola, snapdragons, cherry, sunflower, pot marigolds, 
Helichrysum (an ornamental plant), wheat, barley, oats, triticale, carrots, onions, orchids, 
roses and petunias. 

41 . The genetically modified plant of Claim 40 wherein the plant is a cotton plant. 

42. The genetically modified plant of any one of Claims 34 to 41 comprising a 
nucleic acid molecule encoding Potl A and/or PotlB. 

43. Seeds or other reproduction material from the plant of any one of Claims 34 to 
42. 

44. A method for modulating activity of the HpChS or a homolog or variant 
thereof in an insect, said method comprising contacting the HpChS protein or its homolog 
or variant with an effective amount of an agent for a time and under conditions sufficient 
to decrease or increase HpChS activity. 

45. A method for modulating expression of HpF5 or homolog or variant in an 
insect, said method comprising contacting HpF5 or its homolog or variant with an effective 
amount of an agent for a time and under conditions sufficient to decrease or increase HpF5 
expression. 

46. A method is provided for the isolation of and separation of individual isofonns 
of chymotrypsin, said method consisting of: 
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(i) affinity chromatography of insect gut extracts initially with 
benzamidine sepharose to bind trypsins; 

(ii) further affinity chromatography of the unbound proteins using 
immobilized N. alata serine proteinase inhibitor CI to bind all NaPI 
inhibitable chymotypsins; and 

(iii) affinity chromatography of the eluate from (ii) with immobilized PotI 
and Potll or chymostatin to bind the remainder. The putative NaPI- 
insensitive chymotrypsins are then eluted with 8 M urea. 



47. A method for screening for an antagonist of a NaPI-insensitive chrymotrypsin 

from Helicoverpa spp said method comprising contacting same chrymotrypsin with a 
potential antagonist and screening for chrymotrypsin activity. 



48. 



An isolated inhibitor identified by the method of Claim 47. 
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Low protein haricot bean artificial diet 
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Figure 2B 
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Figure 2C 
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Figure 2E 
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NaPI resistant chymotrypsin activity tested at a range of 
pH's and substrates 
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CAA72966 MKLLAVTLLAFAAWSARNIDLEDVIDLEDITAYDYHTKIGIPLAEKIRA 

CAA72959 LAVTLLAFAAWSARNIDLEDVIDLEDITAYDYHTKIGIPLAEKIRA 

CAA72960 MKLLAVTLLAFAAIVSARNIDLEDVIDLEDITAYDYHTKIGIPLAEKIRA 

CAA72958 INHEAWDLEDITAYGYHTKVGIPLAEEIRI 

CAA72952 MKLFLGVCLTLAVAVSAVEIATPD ADSPVFGYHAKFGIAEAARIKS 

CAA72951 

FWY79-* FWGl-» 

CAA72966 AEEEAERNPSRIVGGSTSSLGAF lPYQAGLl LATFASGQGVCGGSLLNNRRV 

CAA72959 AEEEAERNPSRIVGGSTSSLGAF PYQAGLL ATFASGQGVCGGSLLNNRRV 

CAA72960 AEEEAERNPS RI VGG S I S S LG A F|P YQAGQ lATFASGQGVCGG S L L N N RRV 

CAA72958 AELEASRNPSRIVGGS lSASLG^FPYQAGL LINLPLGQSVCGGSLLNQRRV 

CAA72952 AEEVQSFNGQRIVGGSITNIANV lPYQAGLl VITIFIFQSVCGASLISHNRL 

CAA72951 HNKWV 

CAA72966 LTAAHCWFDGRNQARSFTWLGSVRLFSGGTRLNTASWMHGSWNPNLIR 

CAA72959 LTAAHCWFDGRNQARS FTWLGSVRL FSGGTRLNTAS WMHGSWN PN LI R 

CAA72960 LTAAHCWFDGRNQARS FTWLGSVRLFSGGTRLNTASWMHGSWNPNLIR 

CAA72958 LTAAHCWFDGRNQANSLTVILGSINLYFGGTRLNSNSWMHGSWNPNLIR 

CAA72952 VTAAHCKSDGVLTANSFTWLGSNTLFFGGTRINTNDWMHPNWNPNTAA 

CAA72951 LTAAHCLANRITFWRFGLTNLTRPEILVESANKYIHPDYDEIRAG-VQT 

CAA72966 NDIAMINLPSNVATSGNIAP — IALPSGNELNNNFNGATAVASGFGLARD 

CAA72959 NDIAMINLPSNVATSGNIAP — IALPSGNELNNNFNGATAVASGFGLARD 

CAA72960 NDIAIINLPSNVATSGNIAP — IALPSGNELNNNFNGATAVASGFGLAND 

CAA72958 NDIAIINLPSNVGTSNNIAP — IALPSGNELNNQFAGFTATASGFGRTRD 

CAA72952 NDIAVLRISS-VSFSNVIQP — IALPSGDELNNLFVGANALASGFGRTSD 

CAA72951 ADLALVGLDHHIEYSANVQPSRLMSSAQKNINYE — GIQMIVSGFGRTDD 

FWY72-> 

CAA72966 GGSVDGN LRHVNLPVITNAVCTVSFP-GIIQSS-NICTSG — ANGRS 

CAA72959 GGSVDGN LRHVNLPVITNAVCTV5FP-GIIQSS-NICTSG — ANGRG 

CAA72960 GGSVDGN LRHVNLPVITNAVCTVSFP-GIIQSS-NICTSG — ANGRS 

CAA72958 GGSVSPT LNHVNLPVITNNVCWQSFP-LYIQSS-NICTSG — ANGRS 

CAA72952 SGSIGTN-QQLS SVTIPVIT NAQCAAVYGSGFVHAS-NICTSG — AGGKG 

CAA72951 LWNGGAASEIL UWVYQRGVI SNEECLRWYPTSQVIKEETICAGYWDNPSQS 

<-RVG4 

CAA72966 TCQGDSGGPLWTSN-NRRILIGVTSFGSARGCQVGS 

CAA72959 TCQGDSGGPLWTSN-NRRILIGVTPFGSARGCQVGS 

CAA72960 TCQGDSGGPLWTSN-NRRILIGVTSFGSARGCQVGS 



PAAFARVTSFISW 



CAA72958 TCQGDSGGPLVVTSN-NRRILIGVTSFGSDRGCQVGAPAAFARVTSYISW 



CAA72952 TCNGDSGGPLAVDSN-NRKI LIGVTS YGAQAGCAAG F^MFARjVTS FVDW 



PAAFARVTSFISW 



PAAFARVTSFISW 



CAA72951 SCQGDSGGPLTIIDADGERTQVGIVSFGSTAGCNSPFPSGYVRPGHYHS 
4-RVY72 

CAA72966 INQRL - - 

CAA72959 INQRL 

CAA72960 INNLL 

CAA72958 INQRL 

CAA72952 VQSQ 

CAA72951 [FTEVh TGINFDWDSDAIIPGSSESSSSESSEPSSESSESSSEEDGANPSSS 

CAA72966 

CAA72959 

CAA72960 - 

CAA72958 

CAA72952 — - 

CAA72951 EEDGSNPSSEEDAGSPPSEEEEAPEKVRWEY 
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Primer Name 


Sequence 5'-3' V ! 


FWG1 


TCC CTT ACC AGG C(GCT) GTC [SEQ ID NO: 38] 


RVG4 


TCT GGC GAA GGC AGC AGG [SEQ ID NO: 39] 


i Y79FW 


CTG CTA GCC TCG GAC AAT TC [SEQ ID NO: 40] 


Y72Fw 


CTG GAG TGC AGA CTG CTG AC [SEQ ID NO: 41] 


Y72Rv 


GGA TGA TGG CGT CGC TGT CC [SEQ ID NO: 42] 
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HpFlApcr 

HpFlBpcr 

HpF2Apcr 

HpF3pcr 

HpF4pcr 

HpFlApcr 

HpFlBpcr 

HpF2Apcr 

HpF3pcr 

HpF4pcr 

HpFlApcr 

HpFlBpcr 

HpF2Apcr 

HpF3pcr 

HpF4pcr 

HpFlApcr 

HpFlBpcr 

HpF2Apcr 

HpF3pcr 

HpF4pcr 

HpFlApcr 

HpFlBpcr 

HpF2Apcr 

HpF3pcr 

HpF4pcr 



]VITI FI FQSVCGASLI PHNRLVTAAHCKSDGVLTANS FTWLGS 50 

jVITIFIFQSVCGASLISHNRLVTAAHCKFDGVMTANSFTWLGS 50 

JLANFASGQGVCGGSLLNQRRVLTAAHCWFDGRNQARS FTWLGS 50 

IaslgqfI pyqagllinlplgqsvcggsllnqrrvltaahcwfpgrnqatsltvilgs 57 



NTLFFGGTRINTNDWMHPNWNPSTAANDIAVMRISS-VSFSNVIQPIAL 99 
NTLFFGGTRINTNDWMHPNWNPSTVANDIAVIRISS-IVFNNVIQPIAL 99 
VRLFSGGTRLDTASWMHGSWNPNLIRNDIAMINLPSNVATSGNIAPIAL 100 
INLFFGGTRLNSNSWMHGSWN PNLIRNDI AIINLPSNVGTSGNIAPIAL 100 
1SGVQTADI LALVGLDQEIEYSANVQPSRL 28 

PSGDELNNLFVGANALASGFGRTSDGGSIG5NQQ-VSSVTIPVITNDECA 148 
PSGDELNN L FVGANALASG FG RTSDSGG IGTNQQ- LS SVTI PVITN AECA 148 

PSGNELNNNFNGATATA5GFGLARDGGSVDGN LRHVNLPVITNAVCT 147 

PSGNELNNQFAGFTATASGFGLTRDGGNVSPT LNHVNL PVITN NVCW 147 

MSSAQKNINYEGIQMIVSGFGRTDDLWNGGAASEILLWVYQRGVSNEECL 78 

AVYGS-AFVHSSNICTSGAGG — KGTCNGDSGGPLAIDSNN-EKILIGVT 194 
AVYGP-AFVHDTNICTSGAGG — KGTCNGDSGGPLAVDSND-KKILIGVT 194 

VSFP — GIIQSSNICTSGANG — RSTCQGDSGGP 177 

QSFP — LYIQSTNICTSGANG — RGTCQGDSGGPLVVTSNN-RRILIGVT 192 
RWYPTSQVIKEQTICAGYWDNPSQSSCQGDSGGPLTIIDADGERTQVGIV 128 

S YGAQAGCAAG UPAAFARKI 213 

S YGAADGC AAG F jPAAS P ER| 213 

S FG S DRGCQVGA |P AA FA R| 210 

S FG STAGCN S P F P SG YV R PG H YH D W FTEVTG I N FDW|DSDAII] 120 
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HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3A 

HpCh4I 

HpCh4lI 

HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3A 

HpCh4l 

HpCh4lI 



HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3A 

HpCh4l 

HpCh4lI 



HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3A 

HpCh4l 

HpCh4lI 



HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3A 

HpCh4l 

HpCh4lI 



HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3A 

HpCh4l 

HpCh4lI 

HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3A 

HpCh4l 

HpCh4lI 



AVSA VEIGTPDAD SPVFGYHAKF GIPEAARIKS 

MKLFLGVCLA LAVAVSA VEIGTPEAG SPVFGYHAKF GIAEAARIKS 

MKLLAVTLLA FAAW5ARNI DLEDVIDLED ITAYDYHTKI GIPLAEEIRA 
MKLLAVTLLA FAAVVSARNI DLEDVIDLED ITAYDYHTKI GIPLAEKIRA 



MAAAYLLGLL 
MAAAYLLGLL 

AEEVQSFNGQ 
AEEVQSFNGQ 
AEEEAERDPS 
AEEEAERNPS 

— EFSSG — S 
--EFSSF--S 

LVTAAHCKSD 
LVTAAHCKFD 
VLTAAHCWFD 
VLTAAHCWFD 
VLTAAHCWFD 
VLTAAHCLAN 
VLTAAHCLAN 



FVLGYVQGGL 
FVLGYVQGGL 

ii 

RIVGGSITDI 
RIVGGSITNI 
RIVGGSTSSL 
RIVGGSTSSL 
-IVGGSSASL 
RIVAGWPAVE 
RIVAGWPAVE 



GVLTANSFTV 
GVMTANSFTV 
GRNQARSFTV 
GRNQARSFTV 
GRNQATSLTV 

RITFW 

RITFW 



LNADPAIIED LRDA- 
LNADPAIIED LRDA- 



ANVPYQAGLV 
ANVPYQAGLV 
GAFPYQAGLL 
GAFPYQAGLL 
GQFPYQAGLL 
GQIPYQGSLR 
GQIPYQGSLR 



VLG5NTLFFG 
VLGSNTLFFG 
VLG5VRLFSG 
VLGSVRLYSG 
ILGSINLFFG 
RFGLTNLTRP 
RFGLTNLTRP 



ITIFIF-QSV 
ITIFIF-QSV 
ANFASG-QGV 
ASFASG-QGV 
INLPLG-QSV 
MVSAIGGVSS 
MV5AIGGVSS 



GTRINTNDW 
GTRINTNDW 
GTRLDTASW 
GTRLNTASVV 
GTRLNSNSW 
EILVESTNKY 
EILVESTNKY 



MHPNWNPS — 
MHPNWNPS — 
MHGSWNPN — 
MHGSWNPN — 
MQGSWNPN — 
IHPEYDEIRA 
IHPEYDEIRA 



33 
46 
50 
50 

34 
34 



CGASLISHNR 82 

CGASLISHNR 95 

CGGSLLNQRR 99 

CGGSLLNVRR 99 

CGGSLLNQRR 38 

CGCSLIHNKW 80 

CGCSLIHNKW 80 



130 

143 

147 

147 

86 

126 

126 



-TAANDIAVM RISS-VSFSN VIQPIALPSG DELNNLFVGA NALASGFGRT 178 

-TVANDIAVI RISS-IVYNN VIQPIALPSG DELDNLFVGA NALASGFGRT 191 

-LIRNDIAMI NLPSNVATSG NIAPIALPSG NELNNNFNGA TATASGFGLA 196 

-LVRNDIAMI NLPSNVATSG NIAPIALPSG NELNNQFAGA TATASGFGLA 196 

-LIRNDIAII NLPSNVGTSG NIAPIALPSG NELNNQFAGF TATASGFGLT 135 

GVQTADLALV GLDHEIEYSA NVQPSRLMSS AQKNINYEGI QMIVSGFGRT 176 

GVQTADLALV GLDQEIEYSA NVQPSRLMSS AQKNINYEGI QMIVSGFGRT 176 

SDGGSIGSNQ Q-VSSVTIPV ITNDECAAVY GS-AFVHSSN ICTSGAGG-- 224 

SDSGGIGTNQ Q-LSSVTIPV ITNAECAAVY GP-AFVHDTN ICTSGAGG-- 237 

RDGGSVDGN LRHVNLPV ITNAVCTVSF P — GIIQSSN ICTSGANG-- 239 

RDGGVIDGN LRHVNLPV ITNAVCSQSF P—GLIQASN VCTSGANG 239 

RDGGNVSPT LNHVNLPV ITNNVCWQSF P — LYIQSTN ICTSGANG-- 178 

DDLWNGGAAS EILLWVYQRG VSNEECLRWY PTSQVIKEQT ICAGYWDNPS 226 

DDLWNGGAAS EILLWVYQRG VSNEECLRWY PTSQVIKEQT ICAGYWDNPS 226 

¥ • T § • # 

KGTCNGDSGG PLAVDSNNEK IL-IGVTSYG AQAGCAVG LPAAFAR 268 

KGTCNGDSGG PLAVDSNDKK IL-IGVTSYG AADGCAAG FPAAFAR 281 

RSTCQGDSGG PLWNSNNRR IL-IGVTSFG SARGCQVG SPAAFAR 283 

RSTCQGDSGG PLWNSNNRR IL-IGVTSFG SARGCQVG SPAAFAR 283 

RGTCQGDSGG PLVVTSNNRR IL-IGVTSFG SDRGCQVG APAAFAR 222 

QSSCQGDSGG PLTIIDADGE RTQSRYCELR — IHCWNA AHSPQGYVR 272 

QSSCQGDSGG PLTIIDADGE RTQVGIVSSD PLLDATVHSP RVTSPGHYHD 276 

VTSFVSWVQS Q 279 

VTSFVSWVQS Q 29^ 

VTSFISWINQ RL 295 

VSSYISWINQ RL 295 

VTSYISWINQ RL 234 

PGHYHDWFTE VTGINFDWDS DAIIP "Z"~" \Va 

GHRGDRHQLR LGQRRHYPDS SESSLRAAIL PLESSRAFIR RNQSSFRGGL 326 
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HpChlAI 279 

HpChlBI 292 

HpCh2A - - 295 

HpCh2B - 295 

HpCh3A - - -- 234 

HpCh4l - — 297 

HpCh4lI CQPPRFPTRT VPTHLPRRTL AAPPSEEEEA PEKVRWEY 365 
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[SEQ ID NO: 55] Rechla IVGGSLSjSVGj QIPjYQAGLjyg JDLAS^QAV-C GGSLISA 

[SEQ ID NO: 56] Rechlb IVGGSTSSVG QFPYQAGLLA SFAGGQAV-C G 

[SEQ ID NO: 57] Famllyla IVGGSITDIA NVPYQAGLVI TIFIFQSV-C GASLISHN 

[SEQ ID NO: 58] Famxlylb IVGGSITNIA NVPYQAGLVI TIFIFQSV-C GASLISHN 

[SEQ ID NO: 59] Family2a IVGGSTSSLG AFPYQAGLLA SFASGQGV-C GGSLLNVR 

[SEQ ID NO: 60] FamllyZb IVGGSTSSLG AFPYQAGLLA NFASGQGV-C GGSLLNQR 

[SEQ ID NO: 61] Family3 IVGGSSASLG QFPYQAGLSL IY-SGQSV-C GGSLLNQRR 

[SEQ ID NO: 62] Fam±ly4 IVAGWPAVEG QIPYQGSLRM VSAIGGVSSC GCSLIHN 



Figure 11 A 





N-terrainal sequence of resistant chymotrypsin (Rech 1 a) 


Fl F2 

IVGGSLSgyg^ffiYQAGL^I^^QAVCGGSLISA [SEQ ED NO:9] 


Primer Name 


Oligonucleotide sequence 5'-3' 


Fw2ResChy = Fl 


TC(AGCT) GT(AGCT) GG(AGCT) CA(AG) AT(ACT) CC [SEQ ID NO: 10] 


FwResChym = F2 


GT(AGCT) AT(ACT) GA(CT) CT(AGCT) GC(AGCT) GG(AGCT) GG [SEQ ID NO: 1 1] 
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HpCHFS GTTCACCTCGAGGATTCTATTGATCTGGAAGATATTACCGCTTGGGGATA 5 0 
VHLEDSIDLEDITAWGY 
-40 "24 
HpCHFS CCTCACCAAATTCGGTATTCCAGAAGCTGAGAAAATCCGCAACGCTGAAG 100 
LTKFGIPEAEKIRNAE 

-8 

HpCHFS AAGCTAGCTCTGCTAGCAGGATCGTCGGTGGT^A^GTCCAGTG2£|gA 150 

E A S S A S R a FTW' G^&^l \fS^m\^f^ 

-ll+i *" ~ +1° 

HpCHFS CAGATCCCTTACCAGGCT GGTGTCGTCATTGACTTAGCAGGTGG CCAGGC 200 
HpCHFS TGTCTGCGGAGGCTCCCTGATC AGCG CTTCCCGCGTACTGACCGCTGCTC 250 

^IIM^MMijMMM^ s r v l t a a 

HpCHFS ACTGCTGGTTCGACGGCCAAAACCAGGCCTGGAGATTCACCGTTGTTCTT 300 
HCWFDGQNQAWRFTVVL 
# 

HpCHFS GGTTCCACCACCTTGTTCTCTGGCGGTACCAGAATCCCTACATCCAATGT 350 
GSTTLFSGGTRI PTSNV 

HpCHFS TGTTATGCACGGAAGCTGGACTCCTAGCCTTATCCGTAACGATGTTGCCG 400 
VMHGSWT PSLIRNDVA 

# 

HpCHFS TAATCAGATTGGGCACCAACGTAGCAACCTCAAACACCATTGCCATCATC 450 
VIRLGTNVATSNT IAI I 

HpCHFS GCTCTACCCAGCGGCAGCCAGATCAACGAGAACTTCGCCGGTGAAACCGC 500 
ALPSGSQINENFAGETA 

HpCHFS CCTCGCCTCCGGCTTCGGTCTCACCAGTGACACCGGCAGCATCTCCAGCA 550 
LASGFGLTSDTGSISS 

HpCHFS ACCAGGCTCTGAGCCACGTCAACCTGCCAGTGATCACCAACGCTGTGTGC 600 
NQALSHVNLPVITNAVC 

HpCHFS AGAAATTCATTCCCCCTGCTGATCCAGGACTCTAACATTTGCACCAGCGG 650 
RNSFPLLIQDSNICTSG 

HpCHFS TGCCAACGGCAGGAGCACTTGCCGCGGTGACTCCGGCGGTCCTCTCGTCG 700 
ANGRSTCRGDSGGPLV 

§ # 
HpCHFS TCACCAGGAACA&CAGACCACTCTTGATCGGTATCACCTCTTTCGGATCT 750 
VTRNNRPLLIGITSFGS 

HpCHFS flCCnfiClGGTTRCCAAGTTGGATCT CCCGCTGCCTTCGCCAGAGT CACCTC 800 
ARGCQVGS PAAFARVTS 

HpCHFS TTACATCAGCTGGATCAACGGCCAGCTCTAAAATATCGAACATTTTGCCA 850 
YISWINGQL* 

HpCHFS TATCTACAGAGATATTTTGAAATACGTTAATTTAAATAAATATTTTATTT 900 
HpCHFS ATTCAAAAAAAAAAAAAAAAA 921 
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HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3 

HpCh4l 

HpCh5 

Trypsiniv 

HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3 

HpCh4l 

HpChS 

Trypsiniv 

HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3 

HpCh4l 

HpCh5 

Trypsiniv 

HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3 

HpCh4AI 

HpChS 

Trypsiniv 

HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3 

HpCh4l 

HpCh5 

Trypsiniv 

HpChlAI 

HpChlBI 

HpCh2A 

HpCh2B 

HpCh3 

HpCh4l 

HpChS 

Trypsiniv 



SPYQAGLjE 
PYQAGL 
PYQAGL 
PYQAGL 

PY^b 

mm 




STIFI-FfflSl 
S®TIFI-FJS 

iJASFA!S-gg 



rmvswigIyss 

$IDL&!g-§ 
SHNSGSH— — 



rRPE-ILMESTNKYI 

aS^-SBlHlSfJir™" 

- KTRIQVR|LiG EHf^I KJ^LE@N EQF^JjAA^ 




144 
144 
144 
144 
143 





— m 



ILIGVTSEjGj 
ILIGVTSgG 
R0ILIGVT5 5G 

r|iligvtseg 

RBI LIGVTS 
R 

RBiLIGgTS 



lARgHQVgS 

. bS^reSqw 

IDARGE^QSRYCELRIHCWNATAHj 

' qiHg aBf^gAR^QVH 

;G QlQSfiVHWSHGCAWWN- 




235 

235 

234 

234 

233 

NFDWDSDAIIP 256 

236 

NS 224 
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[SEQ ID NO: 71] DH04_H02 ARMIGERA 
[SEQ ID NO: 72] RECH1A PUNCTIGERA 

DH04_H02 ARMIGERA 
RECH1A PUNCTIGERA 

DH04_H02 ARMIGERA 
RECH1A PUNCTIGERA 

DH04_H02 ARMIGERA 
RECH1A PUNCTIGERA 

DH04_H02 ARMIGERA 
RECH1A PUNCTIGERA 

DH04_H02 ARMIGERA 
RECH1A PUNCTIGERA 

DH04_H02 ARMIGERA 
RECH1A PUNCTIGERA 



VHLEDSIDLE DITAWGYLTK FGIPEAEKIR NAEEASSASR 



IVGGSLSSLG QIPYQAGLVI DLSGGQAVCG GSLISASRVL 
A 

TAAHCWFDGQ NQAWRFTWL GSTTLFSGGT RIATSNVVMH 
P 

GSWTPSLIRN DVAVIRLGTN VGTSNTIAII ALPSGSQINE 



NFAGETALAS GFGLTSDSGS ISSNQALSHV NLPVITNAVC 
T 

RSSFPLLIQD SNICTSGANG RSTCRGDSGG PLWTRNSRP 
• N N.. 

LLIGITSFGS ARGCQVGSPA AFARVTSYIS WINGQ 



Figure 14 



WO 2004/094630 



PCT/AU2004/000524 



21/36 



BOV CHB 
BOV CHA 
HpCh2A 
Hpch5 



BOV CHB 
BOV CHA 
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HpCh5 



BOV CHB 
BOV CHA 
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HpCh5 



BOV CHB 
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Hpch2B 
HpCh5 
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Figure 17 
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Figure 18A 




Figure 18B 
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Figure 19A 
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10/55*237 



kDa 
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Figure 20A 
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coomassie 



a -CH antibody 



Figure 20B 



Silver stain 



a-ch antibody 
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Coomassie a-HIS a-RC a-SC 



kDa SC RC SC RC SC RC SC RC 
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Days after egg hatch 



Figure 23 
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35/36 mo®2&w. 

ggatccATGAAACTCTTGGCTGTGACTCTATTGGCTTTCGCCGCGGTCGTCTCCGCGAGG 60 
MKLLAVTLLAFAAVVSAR 

ggatccATGAAACTCTTGGCTGTGACTCTATTGGCtttcg 

FwBacRECH2 Primer > 

TTGGCTTTCGCCGCGGTCGTCTCCGCGAGG 
FwBacRECH2 Primer 

Aacgg [a] tcccaccatcaccatcaccatGTTCACCTCGAGGATTCTATTGATCTGGAAGAT 120 
NGSHHHHHHVHLEDSIDLED 

Aacgg \g\ tccc 
> 

ATTACCGCTTGGGGATACCTCACCAAATTCGGTATTCCAGAAGCTGAGAAAATCCGCAAC 180 
ITAWGYLTKFGI PBAEKI R N 

GCTGAAGAAGCTAGCTCTGCTAGCAGGATCGTCGGTGGTTCATTGTCCAGTGTCGGACAG 240 
AEEAS SASRIVGGSLSSVGQ 

ATCCCTTACCAGGCTGGTCTCGTCATTGACTTAGCAGGTGGCCAGGCTGTCTGCGGAGGC 300 
I PYQAGLVI DLAGGQAVCGG 

TCCCTGATCAGCGCTTCCCGCGTACTGACCGCTGCTCACTGCTGGTTCGACGGCCAAAAC 360 
SLI SASRVLTAAHCWFDGQN 

CAGGCCTGGAGATTCACCGTTGTTCTTGGTTCCACCACCTTGTTCTCTGGCGGTACCAGA 420 
QAWRFTVVLGSTTLFSGGTR 

ATCCCTACATCCAATGTTGTTATGCACGGAAGCTGGACTCCTAGCCTTATCCGTAACGAT 480 
IPTSNVVMHGSWTPSLIRND 

GTTGCCGTAATCAGATTGGGCACCAACGTAGCAACCTCAAACACCATTGCCATCATCGCT 540 
VAVIRLGTNVATSNTIAIIA 

CTACCCAGCGGCAGCCAGATCAACGAGAACTTCGCCGGTGAAACCGCCCTCGCCTCCGGC 600 
LPSGSQINENFAGETALASG 

TTCGGTCTCACCAGTGACACCGGCAGCATCTCCAGCAACCAGGCTCTGAGCCACGTCAAC 660 
FGLTSDTGSISSNQALSHVN 

CTGCCAGTGATCACCAACGCTGTGTGCAGAAATTCATTCCCCCTGCTGATCCAGGACTCT 720 
LPVITNAVCRNSFPLLIQDS 

AACATTTGCACCAGCGGTGCCAACGGCAGGAGCACTTGCCGCGGTGACTCCGGCGGTCCT 780 
NICTSGANGRSTCRGDSGGP 

CTCGTCGTCACCAGGAACAACAGACCACTCTTGATCGGTATCACCTCTTTCGGATCTGCC 840 
LVVTRNNRPLLIGITSFGSA 

CGCGGTTGCCAAGTTGGATCTCCCGCTGCCTTCGCCAGAGTCACCTCTTACATCAGCTG 900 
RGCQVGSPAAFARVTSYISW 

GATCAACGGCCAGCTCTAAaagctt 925 
INGQL*KL 

GATCAACGGCCAGCTCTAAaagctt 
< RvRECH Primer 



Figure 28 
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SEQUENCE LISTING 

<110> Hexima Limited 

La Trobe University 
Dunse, Kerry, Michelle (US ONLY) 
Heath, Robyn, Louise (US ONLY) 
Anderson, Marilyn, Anne (US ONLY) 

<120> Novel insect enzymes and inhibitors thereof 

<130> 12440340/EJH 

<150> US 60/465,054 
<151> 2003-04-23 

<160> 93 

<170> Patentln version 3.1 

<210> 1 

<211> 5 

<212> PRT 

<213> peptide 

<400> 1 



Glu Glu Lys Lys Asn 
1 5 
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-2- 

<210> 2 

<211> 219 

<212> PRT 

<213> Helicoverpa 

<400> 2 

lie Val Gly Gly Ser Leu Ser Ser Val Gly Gin lie Pro Tyr Gin Ala 
15 10 15 



Gly Leu Val He Asp Leu Ala Gly Gly Gin Ala Val Cys Gly Gly Ser 
20 25 30 



Leu He Ser Ala Ser Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp 
35 40 45 



Gly Gin Asn Gin Ala Trp Arg Phe Thr Val Val Leu Val Met His Gly 
50 55 60 



Ser Trp Thr Pro Ser Leu He Arg Asn Asp Val Ala Val He Arg Leu 
65 70 75 80 



Gly Thr Asn Val Ala Thr Ser Asn Thr He Ala He He Ala Leu Pro 
85 90 95 



Ser Gly Ser Gin He Asn Glu Asn Phe Ala Gly Glu Thr Ala Leu Ala 
100 105 HO 



Ser Gly Phe Gly Leu Thr Ser Asp Thr Gly Ser He Ser Ser Asn Gin 
115 120 125 
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-3- 



Ala Leu Ser His Val Asn Leu Pro Val He Thr Asn Ala Val Cys Arg 
130 135 140 



Asn Ser Phe Pro Leu Leu He Gin Asp Ser Asn He Cys Thr Ser Gly 
145 150 155 160 



Ala Asn Gly Arg Ser Thr Cys Arg Gly Asp Ser Gly Gly Pro Leu Val 
165 170 175 



Val Thr Arg Asn Asn Arg Pro Leu Leu He Gly He Thr Ser Phe Gly 
180 185 190 



Ser Ala Arg Gly Cys Gin Val Gly Ser Pro Ala Ala Phe Ala Arg Val 
195 200 205 



Thr Ser Tyr He Ser Trp He Asn Gly Gin Leu 
210 215 



<210> 3 

<211> 40 

<212> PRT 

<213> Helicoverpa 

<400> 3 

Val His Leu Glu Asp Ser He Asp Leu Glu Asp He Thr Ala Trp Gly 
15 10 15 



Tyr Leu Thr Lys Phe Gly He Pro Glu Ala Glu Lys He Arg Asn Ala 
20 25 30 
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-4- 



Glu Glu Ala Ser Ser Ala Ser Arg 
35 40 

<210> 4 

<211> 708 

<212> DNA 

<213> Helicoverpa 

<400> 4 

atcgtcggtg gttcattgtc cagtgtcgga cagatccctt accaggctgg tctcgtcatt 60 

gacttagcag gtggccaggc tgtctgcgga ggctccctga tcagcgcttc ccgcgtactg 120 

accgctgctc actgctggtt cgacggccaa aaccaggcct ggagattcac cgttgttctt 180 

ggttccacca ccttgttctc tggcggtacc agaatcccta catccaatgt tgttatgcac 240 

ggaagctgga ctcctagcct tatccgtaac gatgttgccg taatcagatt gggcaccaac 300 

gtagcaacct caaacaccat tgccatcatc gctctaccca gcggcagcca gatcaacgag 360 

aacttcgccg gtgaaaccgc cctcgcctcc ggcttcggtc tcaccagtga caccggcagc 420 

atctccagca accaggctct gagccacgtc aacctgccag tgatcaccaa cgctgtgtgc 480 

agaaattcat tccccctgct gatccaggac tctaacattt gcaccagcgg tgccaacggc 540 

aggagcactt gccgcggtga ctccggcggt cctctcgtcg tcaccaggaa caacagacca 600 

ctcttgatcg gtatcacctc tttcggatct gcccgcggtt gccaagttgg atctcccgct 660 

gccttcgcca gagtcacctc ttacatcagc tggatcaacg gccagctc 708 
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<210> 5 

<211> 120 

<212> DNA 

<213> Helicoverpa 

<400> 5 

gttcacctcg aggattctat tgatctggaa gatattaccg cttggggata cctcaccaaa >60 
ttcggtattc cagaagctga gaaaatccgc aacgctgaag aagctagctc tgctagcagg 120 



<210> 6 

<211> 921 

<212> DNA 

<213> Helicoverpa 

<400> 6 

gttcacctcg aggattctat tgatctggaa gatattaccg cttggggata cctcaccaaa 60 

ttcggtattc cagaagctga gaaaatccgc aacgctgaag aagctagctc tgctagcagg 120 

atcgtcggtg gttcattgtc cagtgtcgga cagatccctt accaggctgg tctcgtcatt 180 

gacttagcag gtggccaggc tgtctgcgga ggctccctga tcagcgcttc ccgcgtactg 240 

accgctgctc actgctggtt cgacggccaa aaccaggcct ggagattcac cgttgttctt 300 

ggttccacca ccttgttctc tggcggtacc agaatcccta catccaatgt tgttatgcac 360 

ggaagctgga ctcctagcct tatccgtaac gatgttgccg taatcagatt gggcaccaac 420 

gtagcaacct caaacaccat tgccatcatc gctctaccca gcggcagcca gatcaacgag 480 

aacttcgccg gtgaaaccgc cctcgcctcc ggcttcggtc tcaccagtga caccggcagc 540 

atctccagca accaggctct gagccacgtc aacctgccag tgatcaccaa cgctgtgtgc 600 

agaaattcat tccccctgct gatccaggac tctaacattt gcaccagcgg tgccaacggc 660 
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aggagcactt gccgcggtga ctccggcggt cctctcgtcg tcaccaggaa caacagacca 720 

ctcttgatcg gtatcacctc tttcggatct gcccgcggtt gccaagttgg atctcccgct 780 

gccttcgcca gagtcacctc ttacatcagc tggatcaacg gccagctcta aaatatcgaa 840 

cattttgcca tatctacaga gatattttga aatacgttaa tttaaataaa tattttattt 900 

attcaaaaaa aaaaaaaaaa a 921 



<210> 7 

<211> 32 

<212> DNA 

<213> artificial sequence 
<220> 

<223> BamHI oligonucleotide primer 

<400> 7 

gaccagccgg atccgatcgg atatgcacca ac 

<210> 8 

<211> 34 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Hindlll oligonucleotide primer 



<400> 8 

ggagccaagc caagctttga acgcgggcaa actc 



34 
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<210> 9 
<211> 36 
<212> PRT 

<213> artificial sequence 
<220> 

<223> N-terminal sequence of resistant chymotrypsin 
<400> 9 

He Val Gly Gly Ser Leu Ser Ser Val Gly Gin He Pro Tyr Gin Ala 
15 10 15 



Gly Leu Val He Asp Leu Ala Gly Gly Gin Ala Val Cys Gly Gly Ser 
20 25 30 



Leu He Ser Ala 
35 



<210> 10 

<211> 29 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Fw2ResChy primer 

<400> 10 

tcagctgtag ctggagctca agatactcc 
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<210> 11 
<211> 35 
<212> DNA 

<213> artificial sequence 
<220> 

<223> FwResChym primer 
<400> 11 

gtagctatac tgactctagc tgcagctgga gctgg 

<210> 12 

<211> 21 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Hc35PQE-6-Fw primer 

<400> 12 

ttaaccatgg tgatcgacct c 

<210> 13 

<211> 23 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Hc35PQ-60-Rv primer 

<400> 13 

gatgagatct gagacgttgg ttg 
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<210> 14 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> gene specific sense primer 

<400> 14 

cgggatccat ggagtcaaag tttgc 

<210> 15 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> gene specific antisense primer 

<400> 15 

gcgtcgacgc ttaagccacc ctagg 

<210> 16 

<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> StPOTIA sense primer 



<400> 16 

cgggatccaa ggaatcggaa tctg 



24 
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<210> 17 
<211> 23 
<212> DNA 

<213> artificial sequence 
<220> 

<223> StPOTIB sense primer 
<400> 17 

cgggatccaa ggaatttgaa tgc 

<210> 18 

<211> 22 

<212> DNA 

<213> artificial sequence 
<220> 

<223> StPOTIA/B antisense primer 

<400> 18 

cgagctctta agccacccta gg 

<210> 19 

<211> 40 

<212> DNA 

<213> artificial sequence 
<220> 

<223> FWBacRECHI ( 5 1 -3 » ), primer 

<400> 19 

ttggctttcg ccgcggtcgt ctccgcgagg aacgggtccc 
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<210> 20 
<211> 40 
<212> DNA 

<213> artificial sequence 
<220> 

<223> FWBacRECH2 (5' -3 ? ) primer 
<400> 20 

ggatccatga aactcttggc tgtgactcta ttggctttcg 

<210> 21 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> RvRECH (3' -5') primer 

<400> 21 

gatcaacggc cagctctaaa agctt 

<210> 22 

<211> 53 

<212> PRT 

<213> peptide 

<400> 22 

Met Ala Val His Arg Val Ser Phe Leu Ala Leu Leu Leu Leu Phe Gly 
15 10 15 



Met Ser Leu Leu Val Ser Asn Val Glu His Ala Asp Ala Lys Ala Cys 
20 25 30 
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Thr Leu Asn Cys Asp Pro Arg He Ala Tyr Gly Val Cys Pro Arg Ser 
35 40 45 

Glu Glu Lys Lys Asn 
50 



<210> 23 

<211> 58 

<212> PRT 

<213> peptide 

<400> 23 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys Lys Tyr 
15 10 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg 
20 25 30 

Asn Pro Lys Ala Cys Thr Leu Asn Cys Asp Pro Arg He Ala Tyr Gly 
35 40 45 



Val Cys Pro Arg Ser Glu Glu Lys Lys Asn 
50 55 
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<210> 24 
<211> 58 
<212> PRT 
<213> peptide 

<400> 24 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys Lys Tyr 
15 10 15 



Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg 
20 25 30 



Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Pro Arg He Ala Tyr Gly 
35 40 45 



He Cys Pro Leu Ala Glu Glu Lys Lys Asn 
50 55 



<210> 25 

<211> 58 

<212> PRT 

<213> peptide 

<400> 25 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
15 10 15 



Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys 
20 25 30 
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-14- 

Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly 
35 40 45 



He Cys Pro Leu Ser Glu Glu Lys Lys Asn 
50 55 



<210> 26 

<211> 58 

<212> PRT 

<213> peptide 

<400> 26 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
15 10 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg 
20 25 30 



Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly 
35 40 45 



He Cys Pro Leu Ser Glu Glu Lys Lys Asn 
50 55 
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<210> 27 
<211> 54 
<212> PRT 
<213> peptide 

<400> 27 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
15 10 15 

Phe Ser Asp Asp Gly Thr Phe He Cys Glu Gly Glu Ser Glu Tyr Ala 
20 25 30 



Ser Lys Val Asp Glu Tyr Val Gly Glu Val Glu Asn Asp Leu Gin Lys 
35 40 45 



Ser Lys Val Ala Val Ser 
50 



<210> 28 

<211> 36 

<212> PRT 

<213> Helicoverpa 

<400> t 28 

He Val Gly Gly Ser Leu Ser Ser Val Gly Gin He Pro Tyr Gin Ala 
15 10 15 



Gly Leu Val He Asp Leu Ala Gly Gly Gin Ala Val Cys Gly Gly Ser 
20 25 30 
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-16- 

Leu lie Ser Ala 
35 



<210> 29 

<211> 29 

<212> PRT 

<213> Helicoverpa 

<400> 29 

He Val Gly Gly Ser He Ser Ser He Gly Gin He Pro Tyr Gly Ala 
15 10 15 



Gly Leu Val He Asp Phe Ala Gly Gly Gin Ala Val Cys 
20 25 



<210> 30 

<211> 29 

<212> PRT 

<213> Helicoverpa 

<400> 30 

He Val Gly Gly Ser Thr Ser Ser Val Gly Gin Phe Pro Tyr Gin Ala 
15 10 15 



Gly Leu Leu Ala Ser Phe Ala Gly Gly Gin Ala Val Cys 
20 25 
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<210> 31 

<211> 29 

<212> PRT 

<213> Helicoverpa 

<400> 31 

He Val Gly Gly Ser Val Thr Thr Leu Asp Ala Tyr Pro Thr He Ala 
15 10 15 

Gly Leu Val Tyr Asn Phe Ala Gly Gly Gin Ala Val Cys 
20 25 



<210> 32 
<211> 295 
<212> PRT 

<213> H. punctigera 
<400> 32 

Met Lys Leu Leu Ala Val Thr Leu Leu Ala Phe Ala Ala Val Val Ser 
15 10 15 

Ala Arg Asn He Asp Leu Glu Asp Val He Asp Leu Glu Asp He Thr 
20 25 30 

Ala Tyr Asp Tyr His Thr Lys He Gly He Pro Leu Ala Glu Lys He 
35 40 45 



Arg Ala Ala Glu Glu Glu Ala Glu Arg Asn Pro Ser Arg He Val Gly 
50 55 60 
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Gly Ser Thr Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala Gly Leu Leu 
65 70 75 80 



Ala Thr Phe Ala Ser Gly Gin Gly Val Cys Gly Gly Ser Leu Leu Asn 
85 90 95 



Asn Arg Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp Gly Arg Asn 
100 105 110 



Gin Ala Arg Ser Phe Thr Val Val Leu Gly Ser Val Arg Leu Phe Ser 
115 120 125 



Gly Gly Thr Arg Leu Asn Thr Ala Ser Val Val Met His Gly Ser Trp 
130 135 140 



Asn Pro Asn Leu He Arg Asn Asp He Ala Met He Asn Leu Pro Ser 
145 150 155 160 



Asn Val Ala Thr Ser Gly Asn He Ala Pro He Ala Leu Pro Ser Gly 
165 170 175 



Asn Glu Leu Asn Asn Asn Phe Asn Gly Ala Thr Ala Val Ala Ser Gly 
180 185 190 



Phe Gly Leu Ala Arg Asp Gly Gly Ser Val Asp Gly Asn Leu Arg His 
195 200 205 



Val Asn Leu Pro Val He Thr Asn Ala Val Cys Thr Val Ser Phe Pro 
210 215 220 
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Gly lie He Gin Ser Ser Asn He Cys Thr Ser Gly Ala Asn Gly Arg 
225 230 235 240 



Ser Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Val Thr Ser Asn 
245 250 255 



Asn Arg Arg He Leu He Gly Val Thr Ser Phe Gly Ser Ala Arg Gly 
260 265 270 



Cys Gin Val Gly Ser Pro Ala Ala Phe Ala Arg Val Thr Ser Phe He 
275 280 285 



Ser Trp He Asn Gin Arg Leu 
290 295 



<210> 33 
<211> 292 
<212> PRT 

<213> H. punctigera 
<400> 33 

Leu Ala Val Thr Leu Leu Ala Phe Ala Ala Val Val Ser Ala Arg Asn 
15 10 15 



He Asp Leu Glu Asp Val He Asp Leu Glu Asp He Thr Ala Tyr Asp 
20 25 30 



Tyr His Thr Lys He Gly He Pro Leu Ala Glu Lys He Arg Ala Ala 
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20 



35 



40 



45 



Glu Glu Glu Ala Glu Arg Asn Pro Ser Arg lie Val Gly Gly Ser Thr 
50 55 60 



Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala Gly Leu Leu Ala Thr Phe 
65 70 75 80 



Ala Ser Gly Gin Gly Val Cys Gly Gly Ser Leu Leu Asn Asn Arg Arg 
85 90 95 



Val Leu Thr Ala Ala His Cys Trp Phe Asp Gly Arg Asn Gin Ala Arg 
100 105 110 



Ser Phe Thr Val Val Leu Gly Ser Val Arg Leu Phe Ser Gly Gly Thr 
115 120 125 



Arg Leu Asn Thr Ala Ser Val Val Met His Gly Ser Trp Asn Pro Asn 
130 135 140 



Leu He Arg Asn Asp He Ala Met He Asn Leu Pro Ser Asn Val Ala 
145 150 155 160 



Thr Ser Gly Asn He Ala Pro He Ala Leu Pro Ser Gly Asn Glu Leu 
165 170 175 



Asn Asn Asn Phe Asn Gly Ala Thr Ala Val Ala Ser Gly Phe Gly Leu 
180 185 190 
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Ala Arg Asp Gly Gly Ser Val Asp Gly Asn Leu Arg His Val Asn Leu 
195 200 205 



Pro Val He Thr Asn Ala Val Cys Thr Val Ser Phe Pro Gly He He 
210 215 220 



Gin Ser Ser Asn He Cys Thr Ser Gly Ala Asn Gly Arg Gly Thr Cys 
225 230 235 240 



Gin Gly Asp Ser Gly Gly Pro Leu Val Val Thr Ser Asn Asn Arg Arg 
245 250 255 



He Leu He Gly Val Thr Pro Phe Gly Ser Ala Arg Gly Cys Gin Val 
260 265 270 



Gly Ser Pro Ala Ala Phe Ala Arg Val Thr Ser Phe He Ser Trp He 
275 280 285 



Asn Gin Arg Leu 
290 
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<210> 34 
<211> 295 
<212> PRT 

<213> H, punctigera 
<400> 34 

Met Lys Leu Leu Ala Val Thr Leu Leu Ala Phe Ala Ala lie Val Ser 
15 10 15 



Ala Arg Asn lie Asp Leu Glu Asp Val lie Asp Leu Glu Asp He Thr 
20 25 30 



Ala Tyr Asp Tyr His Thr Lys He Gly He Pro Leu Ala Glu Lys He 
35 40 45 



Arg Ala Ala Glu Glu Glu Ala Glu Arg Asn Pro Ser Arg He Val Gly 
50 55 60 



Gly Ser He Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala Gly Leu Leu 
65 70 75 80 



Ala Thr Phe Ala Ser Gly Gin Gly Val Cys Gly Gly Ser Leu Leu Asn 
85 90 95 



Asn Arg Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp Gly Arg Asn 
100 105 110 



Gin Ala Arg Ser Phe Thr Val Val Leu Gly Ser Val Arg Leu Phe Ser 
115 120 125 
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Gly Gly Thr Arg Leu Asn Thr Ala Ser Val Val Met His Gly Ser Trp 
130 135 140 



Asn Pro Asn Leu He Arg Asn Asp He Ala lie He Asn Leu Pro Ser 
145 150 155 160 



Asn Val Ala Thr Ser Gly Asn He Ala Pro He Ala Leu Pro Ser Gly 
165 170 175 



Asn Glu Leu Asn Asn Asn Phe Asn Gly Ala Thr Ala Val Ala Ser Gly 
180 185 190 



Phe Gly Leu Ala Asn Asp Gly Gly Ser Val Asp Gly Asn Leu Arg His 
195 200 205 



Val Asn Leu Pro Val He Thr Asn Ala Val Cys Thr Val Ser Phe Pro 
210 215 220 



Gly He He Gin Ser Ser Asn He Cys Thr Ser Gly Ala Asn Gly Arg 
225 230 235 240 



Ser Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Val Thr Ser Asn 
245 250 255 



Asn Arg Arg He Leu He Gly Val Thr Ser Phe Gly Ser Ala Arg Gly 
260 265 270 
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Cys Gin Val Gly Ser Pro Ala Ala Phe Ala Arg Val Thr Ser Phe He 
275 280 285 



Ser Trp He Asn Asn Leu Leu 
290 295 



<210> 35 
<211> 276 
<212> PRT 

<213> H. punctigera 
<400> 35 

He Asn His Glu Ala Val Val Asp Leu Glu Asp He Thr Ala Tyr Gly 
15 10 15 



Tyr His Thr Lys Val Gly He Pro Leu Ala Glu Glu He Arg He Ala 
20 25 30 



Glu Leu Glu Ala Ser Arg Asn Pro Ser Arg He Val Gly Gly Ser Ser 
35 40 45 



Ala Ser Leu Gly Gin Phe Pro Tyr Gin Ala Gly Leu Leu He Asn Leu 
50 55 60 



Pro Leu Gly Gin Ser Val Cys Gly Gly Ser Leu Leu Asn Gin Arg Arg 
65 70 75 80 



Val Leu Thr Ala Ala His Cys Trp Phe Asp Gly Arg Asn Gin Ala Asn 
85 90 95 
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Ser Leu Thr Val He Leu Gly Ser He Asn Leu Tyr Phe Gly Gly Thr 
100 105 HO 



Arg Leu Asn Ser Asn Ser Val Val Met His Gly Ser Trp Asn Pro Asn 
115 120 125 



Leu He Arg Asn Asp He Ala He He Asn Leu Pro Ser Asn Val Gly 
130 135 140 



Thr Ser Asn Asn He Ala Pro He Ala Leu Pro Ser Gly Asn Glu Leu 
145 150 155 160 



Asn Asn Gin Phe Ala Gly Phe Thr Ala Thr Ala Ser Gly Phe Gly Arg 
165 170 175 



Thr Arg Asp Gly Gly Ser Val Ser Pro Thr Leu Asn His Val Asn Leu 
180 185 190 



Pro Val He Thr Asn Asn Val Cys Trp Gin Ser Phe Pro Leu Tyr He 
195 200 205 



Gin Ser Ser Asn He Cys Thr Ser Gly Ala Asn Gly Arg Ser Thr Cys 
210 215 220 



Gin Gly Asp Ser Gly Gly Pro Leu Val Val Thr Ser Asn Asn Arg Arg 
225 230 235 240 
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Ile Leu He Gly Val Thr Ser Phe Gly Ser Asp Arg Gly Cys Gin Val 
245 250 255 



Gly Ala Pro Ala Ala Phe Ala Arg Val Thr Ser Tyr He Ser Trp He 
260 265 270 



Asn Gin Arg Leu 
275 



<210> 36 
<211> 292 
<212> PRT 

<213> H. punctigera 
<400> 36 

Met Lys Leu Phe Leu Gly Val Cys Leu Thr Leu Ala Val Ala Val Ser 
15 10 15 



Ala Val Glu He Ala Thr Pro Asp Ala Asp Ser Pro Val Phe Gly Tyr 
20 25 30 

His Ala Lys Phe Gly He Ala Glu Ala Ala Arg He Lys Ser Ala Glu 
35 40 45 



Glu Val Gin Ser Phe Asn Gly Gin Arg He Val Gly Gly Ser He Thr 
50 55 60 



Asn He Ala Asn Val Pro Tyr Gin Ala Gly Leu Val He Thr He Phe 
65 70 75 80 
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He Phe Gin Ser Val Cys Gly Ala Ser Leu He Ser His Asn Arg Leu 
85 90 95 



Val Thr Ala Ala His Cys Lys Ser Asp Gly Val Leu Thr Ala Asn Ser 
100 105 HO 



Phe Thr Val Val Leu Gly Ser Asn Thr Leu Phe Phe Gly Gly Thr Arg 
115 120 125 



He Asn Thr Asn Asp Val Val Met His Pro Asn Trp Asn Pro Asn Thr 
130 135 140 



Ala Ala Asn Asp He Ala Val Leu Arg He Ser Ser Val Ser Phe Ser 
145 150 155 160 



Asn Val He Gin Pro He Ala Leu Pro Ser Gly Asp Glu Leu Asn Asn 
165 170 175 



Leu Phe Val Gly Ala Asn Ala Leu Ala Ser Gly Phe Gly Arg Thr Ser 
180 185 190 



Asp Ser Gly Ser He Gly Thr Asn Gin Gin Leu Ser Ser Val Thr He 
195 200 205 



Pro Val He Thr Asn Ala Gin Cys Ala Ala Val Tyr Gly Ser Gly Phe 
210 215 220 
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Val His Ala Ser Asn lie Cys Thr Ser Gly Ala Gly Gly Lys Gly Thr 
225 230 235 240 



Cys Asn Gly Asp Ser Gly Gly Pro Leu Ala Val Asp Ser Asn Asn Arg 
245 250 255 



Lys He Leu He Gly Val Thr Ser Tyr Gly Ala Gin Ala Gly Cys Ala 
260 265 270 



Ala Gly Phe Pro Ala Ala Phe Ala Arg Val Thr Ser Phe Val Asp Trp 
275 280 285 



Val Gin Ser Gin 
290 



<210> 37 
<211> 257 
<212> PRT 

<213> H. punctigera 
<400> 37 

His Asn Lys Trp Val Leu Thr Ala Ala His Cys Leu Ala Asn Arg He 
15 10 15 



Thr Phe Val Val Arg Phe Gly Leu Thr Asn Leu Thr Arg Pro Glu He 
20 25 30 



Leu Val Glu Ser Ala Asn Lys Tyr He His Pro Asp Tyr Asp Glu He 
35 40 45 
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Arg Ala Gly Val Gin Thr Ala Asp Leu Ala Leu Val Gly Leu Asp His 
50 55 60 



His lie Glu Tyr Ser Ala Asn Val Gin Pro Ser Arg Leu Met Ser Ser 
65 70 75 80 



Ala Gin Lys Asn lie Asn Tyr Glu Gly He Gin Met He Val Ser Gly 
85 90 95 



Phe Gly Arg Thr Asp Asp Leu Trp Asn Gly Gly Ala Ala Ser Glu He 
100 105 HO 



Leu Leu Trp Val Tyr Gin Arg Gly Val Ser Asn Glu Glu Cys Leu Arg 
115 120 125 



Trp Tyr Pro Thr Ser Gin Val He Lys Glu Glu Thr He Cys Ala Gly 
130 135 140 



Tyr Trp Asp Asn Pro Ser Gin Ser Ser Cys Gin Gly Asp Ser Gly Gly 
145 150 155 160 



Pro Leu Thr He He Asp Ala Asp Gly Glu Arg Thr Gin Val Gly He 
165 170 175 



Val Ser Phe Gly Ser Thr Ala Gly Cys Asn Ser Pro Phe Pro Ser Gly 
180 185 190 
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Tyr Val Arg Pro Gly His Tyr His Asp Trp Phe Thr Glu Val Thr Gly 
195 200 205 

He Asn Phe Asp Trp Asp Ser Asp Ala He He Pro Gly Ser Ser Glu 
210 215 220 

Ser Glu Glu Asp Gly Ser Asn Pro Ser Ser Glu Glu Asp Ala Gly Ser 
225 230 235 240 

Pro Pro Ser Glu Glu Glu Glu Ala Pro Glu Lys Val Arg Val Val Glu 
245 250 255 



Tyr 



<210> 38 

<211> 19 

<212> DNA 

<213> artificial sequence 
<220> 

<223> FWG1 primer 

<400> 38 

tcccttacca ggcgctgtc 
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<210> 39 
<211> 18 
<212> DNA 

<213> artificial sequence 
<220> 

<223> RVG4 primer 
<400> 39 

tctggcgaag gcagcagg 

<210> 40 

<211> 20 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Y79Fw primer 

<400> 40 

ctgctagcct cggacaattc 

<210> 41 

<211> 20 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Y72Fw primer 

<400> 41 

ctggagtgca gactgctgac 
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<210> 42 
<211> 20 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Y72Rv primer 
<400> 42 

ggatgatggc gtcgctgtcc 

<210> 43 
<211> 213 
<212> PRT 

<213> H. punctigera 
<400> 43 

Pro Tyr Gin Ala Gly Leu Val lie Thr He Phe He Phe Gin Ser Val 
15 10 15 

Cys Gly Ala Ser Leu He Pro His Asn Arg Leu Val Thr Ala Ala His 
20 25 30 



Cys Lys Ser Asp Gly Val Leu Thr Ala Asn Ser Phe Thr Val Val Leu 
35 40 45 



Gly Ser Asn Thr Leu Phe Phe Gly Gly Thr Arg He Asn Thr Asn Asp 
50 55 60 



Val Val Met His Pro Asn Trp Asn Pro Ser Thr Ala Ala Asn Asp He 
65 70 75 80 
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Ala Val Met Arg He Ser Ser Val Ser Phe Ser Asn Val He Gin Pro 
85 90 95 



He Ala Leu Pro Ser Gly Asp Glu Leu Asn Asn Leu Phe Val Gly Ala 
100 105 HO 



Asn Ala Leu Ala Ser Gly Phe Gly Arg Thr Ser Asp Gly Gly Ser He 
115 120 125 



Gly Ser Asn Gin Gin Val Ser Ser Val Thr He Pro Val He Thr Asn 
130 135 140 



Asp Glu Cys Ala Ala Val Tyr Gly Ser Ala Phe Val His Ser Ser Asn 
145 150 155 160 



He Cys Thr Ser Gly Ala Gly Gly Lys Gly Thr Cys Asn Gly Asp Ser 
165 170 175 



Gly Gly Pro Leu Ala He Asp Ser Asn Asn Glu Lys He Leu lie Gly 
180 185 190 



Val Thr Ser Tyr Gly Ala Gin Ala Gly Cys Ala Ala Gly Leu Pro Ala 
195 200 205 



Ala Phe Ala Arg Lys 
210 
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<210> 44 

<211> 213 

<212> PRT 

<213> H. punctigera 

<400> 44 

Pro Tyr Gin Ala Gly Leu Val He Thr He Phe He Phe Gin Ser Val 
15 10 15 



Cys Gly Ala Ser Leu He Ser His Asn Arg Leu Val Thr Ala Ala His 
20 25 30 



Cys Lys Phe Asp Gly Val Met Thr Ala Asn Ser Phe Thr Val Val Leu 
35 40 45 



Gly Ser Asn Thr Leu Phe Phe Gly Gly Thr Arg He Asn Thr Asn Asp 
50 55 60 



Val Val Met His Pro Asn Trp Asn Pro Ser Thr Val Ala Asn Asp He 
65 70 75 80 



Ala Val He Arg He Ser Ser He Val Phe Asn Asn Val He Gin Pro 
85 90 95 



He Ala Leu Pro Ser Gly Asp Glu Leu Asn Asn Leu Phe Val Gly Ala 
100 105 HO 



Asn Ala Leu Ala Ser Gly Phe Gly Arg Thr Ser Asp Ser Gly Gly He 
115 120 125 
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Gly Thr Asn Gin Gin Leu Ser Ser Val Thr He Pro Val He Thr Asn 
130 135 140 



Ala Glu Cys Ala Ala Val Tyr Gly Pro Ala Phe Val His Asp Thr Asn 
145 150 155 160 



He Cys Thr Ser Gly Ala Gly Gly Lys Gly Thr Cys Asn Gly Asp Ser 
165 170 175 



Gly Gly Pro Leu Ala Val Asp Ser Asn Asp Lys Lys He Leu He Gly 
180 185 190 



Val Thr Ser Tyr Gly Ala Ala Asp Gly Cys Ala Ala Gly Phe Pro Ala 
195 200 205 



Ala Ser Pro Glu Arg 
210 



<210> 45 
<211> 177 
<212> PRT 

<213> H. punctigera 
<400> 45 

Pro Tyr Gin Ala Gly Leu Leu Ala Asn Phe Ala Ser Gly Gin Gly Val 
15 10 15 



Cys Gly Gly Ser Leu Leu Asn Gin Arg Arg Val Leu Thr Ala Ala His 
20 25 30 
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Cys Trp Phe Asp Gly Arg Asn Gin Ala Arg Ser Phe Thr Val Val Leu 
35 40 45 



Gly Ser Val Arg Leu Phe Ser Gly Gly Thr Arg Leu Asp Thr Ala Ser 
50 55 60 



Val Val Met His Gly Ser Trp Asn Pro Asn Leu lie Arg Asn Asp He 
65 70 75 80 



Ala Met He Asn Leu Pro Ser Asn Val Ala Thr Ser Gly Asn He Ala 
85 90 95 



Pro He Ala Leu Pro Ser Gly Asn Glu Leu Asn Asn Asn Phe Asn Gly 
100 105 110 



Ala Thr Ala Thr Ala Ser Gly Phe Gly Leu Ala Arg Asp Gly Gly Ser 
115 120 125 



Val Asp Gly Asn Leu Arg His Val Asn Leu Pro Val He Thr Asn Ala 
130 135 140 



Val Cys Thr Val Ser Phe Pro Gly He He Gin Ser Ser Asn He Cys 
145 150 155 160 
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Thr Ser Gly Ala Asn Gly Arg Ser Thr Cys Gin Gly Asp Ser Gly Gly 
165 170 175 



Pro 



<210> 46 
<211> 217 
<212> PRT 

<213> H. punctigera 
<400> 46 

Ser Ala Ser Leu Gly Gin Phe Pro Tyr Gin Ala Gly Leu Leu He Asn 
15 10 15 



Leu Pro Leu Gly Gin Ser Val Cys Gly Gly Ser Leu Leu Asn Gin Arg 
20 25 30 



Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp Gly Arg Asn Gin Ala 
35 40 45 



Thr Ser Leu Thr Val He Leu Gly Ser He Asn Leu Phe Phe Gly Gly 
50 55 60 



Thr Arg Leu Asn Ser Asn Ser Val Val Met His Gly Ser Trp Asn Pro 
65 70 75 80 



Asn Leu He Arg Asn Asp He Ala He He Asn Leu Pro Ser Asn Val 
85 90 95 
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Gly Thr Ser Gly Asn He Ala Pro He Ala Leu Pro Ser Gly Asn Glu 
100 105 HO 



Leu Asn Asn Gin Phe Ala Gly Phe Thr Ala Thr Ala Ser Gly Phe Gly 
115 120 125 



Leu Thr Arg Asp Gly Gly Asn Val Ser Pro Thr Leu Asn His Val Asn 
130 135 140 



Leu Pro Val He Thr Asn Asn Val Cys Trp Gin Ser Phe Pro Leu Tyr 
145 150 155 160 



He Gin Ser Thr Asn He Cys Thr Ser Gly Ala Asn Gly Arg Gly Thr 
165 170 175 



Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Val Thr Ser Asn Asn Arg 
180 185 190 



Arg He Leu He Gly Val Thr Ser Phe Gly Ser Asp Arg Gly Cys Gin 
195 200 205 



Val Gly Ala Pro Ala Ala Phe Ala Arg 
210 215 
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<210> 47 
<211> 170 
<212> PRT 

<213> H. punctigera 
<400> 47 

Ser Gly Val Gin Thr Ala Asp Leu Ala Leu Val Gly Leu Asp Gin Glu 
1 5 10 15 



He Glu Tyr Ser Ala Asn Val Gin Pro Ser Arg Leu Met Ser Ser Ala 
. 20 25 30 



Gin Lys Asn He Asn Tyr Glu Gly He Gin Met He Val Ser Gly Phe 
35 40 45 



Gly Arg Thr Asp Asp Leu Trp Asn Gly Gly Ala Ala Ser Glu He Leu 
50 55 60 



Leu Trp Val Tyr Gin Arg Gly Val Ser Asn Glu Glu Cys Leu Arg Trp 
65 70 75 80 



Tyr Pro Thr Ser Gin Val He Lys Glu Gin Thr He Cys Ala Gly Tyr 
85 90 95 



Trp Asp Asn Pro Ser Gin Ser Ser Cys Gin Gly Asp Ser Gly Gly Pro 
100 105 HO 



Leu Thr He He Asp Ala Asp Gly Glu Arg Thr Gin Val Gly He Val 
115 120 125 
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Ser Phe Gly Ser Thr Ala Gly Cys Asn Ser Pro Phe Pro Ser Gly Tyr 
130 135 140 



Val Arg Pro Gly His Tyr His Asp Trp Phe Thr Glu Val Thr Gly He 
145 150 155 160 



Asn Phe Asp Trp Asp Ser Asp Ala He He 
165 170 



<210> 48 

<211> 279 

<212> PRT 

<213> H. punctigera 

<400> 48 

Ala Val Ser Ala Val Glu He Gly Thr Pro Asp Ala Asp Ser Pro Val 
1 5 10 15 



Phe Gly Tyr His Ala Lys Phe Gly He Pro Glu Ala Ala Arg He Lys 
20 25 30 



Ser Ala Glu Glu Val Gin Ser Phe Asn Gly Gin Arg He Val Gly Gly 
35 40 45 



Ser He Thr Asp He Ala Asn Val Pro Tyr Gin Ala Gly Leu Val He 
50 55 60 



Thr He Phe He Phe Gin Ser Val Cys Gly Ala Ser Leu He Ser His 
65 70 75 80 
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Asn Arg Leu Val Thr Ala Ala His Cys Lys Ser Asp Gly Val Leu Thr 
85 90 95 



Ala Asn Ser Phe Thr Val Val Leu Gly Ser Asn Thr Leu Phe Phe Gly 
100 105 110 



Gly Thr Arg He Asn Thr Asn Asp Val Val Met His Pro Asn Trp Asn 
115 120 125 



Pro Ser Thr Ala Ala Asn Asp He Ala Val Met Arg He Ser Ser Val 
130 135 140 



Ser Phe Ser Asn Val He Gin Pro He Ala Leu Pro Ser Gly Asp Glu 
145 150 155 160 



Leu Asn Asn Leu Phe Val Gly Ala Asn Ala Leu Ala Ser Gly Phe Gly 
165 170 175 



Arg Thr Ser Asp Gly Gly Ser He Gly Ser Asn Gin Gin Val Ser Ser 
180 185 190 



Val Thr He Pro Val He Thr Asn Asp Glu Cys Ala Ala Val Tyr Gly 
195 200 205 



Ser Ala Phe Val His Ser Ser Asn He Cys Thr Ser Gly Ala Gly Gly 
210 215 220 
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Lys Gly Thr Cys Asn Gly Asp Ser Gly Gly Pro Leu Ala Val Asp Ser 
225 230 235 240 



Asn Asn Glu Lys He Leu He Gly Val Thr Ser Tyr Gly Ala Gin Ala 
245 250 255 



Gly Cys Ala Val Gly Leu Pro Ala Ala Phe Ala Arg Val Thr Ser Phe 
260 265 270 



Val Ser Trp Val Gin Ser Gin 
275 



<210> 49 

<211> 292 

<212> PRT 

<213> H. punctigera 

<400> 49 

Met Lys Leu Phe Leu Gly Val Cys Leu Ala Leu Ala Val Ala Val Ser 
15 10 15 



Ala Val Glu He Gly Thr Pro Glu Ala Gly Ser Pro Val Phe Gly Tyr 
20 25 30 



His Ala Lys Phe Gly He Ala Glu Ala Ala Arg He Lys Ser Ala Glu 
35 40 45 



Glu Val Gin Ser Phe Asn Gly Gin Arg He Val Gly Gly Ser He Thr 
50 55 60 
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Asn He Ala Asn Val Pro Tyr Gin Ala Gly Leu Val He Thr He Phe 
65 70 75 80 



He Phe Gin Ser Val Cys Gly Ala Ser Leu He Ser His Asn Arg Leu 
85 90 95 



Val Thr Ala Ala His Cys Lys Phe Asp Gly Val Met Thr Ala Asn Ser 
100 105 HO 



Phe Thr Val Val Leu Gly Ser Asn Thr Leu Phe Phe Gly Gly Thr Arg 
115 120 125 



He Asn Thr Asn Asp Val Val Met His Pro Asn Trp Asn Pro Ser Thr 
130 135 140 



Val Ala Asn Asp He Ala Val lie Arg He Ser Ser He Val Tyr Asn 
145 150 155 160 



Asn Val He Gin Pro He Ala Leu Pro Ser Gly Asp Glu Leu Asp Asn 
165 170 175 



Leu Phe Val Gly Ala Asn Ala Leu Ala Ser Gly Phe Gly Arg Thr Ser 
180 185 190 



Asp Ser Gly Gly He Gly Thr Asn Gin Gin Leu Ser Ser Val Thr He 
195 200 205 
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Pro Val He Thr Asn Ala Glu Cys Ala Ala Val Tyr Gly Pro Ala Phe 
210 215 220 



Val His Asp Thr Asn He Cys Thr Ser Gly Ala Gly Gly Lys Gly Thr 
225 230 235 240 



Cys Asn Gly Asp Ser Gly Gly Pro Leu Ala Val Asp Ser Asn Asp Lys 
245 250 255 



Lys He Leu He Gly Val Thr Ser Tyr Gly Ala Ala Asp Gly Cys Ala 
260 265 270 



Ala Gly Phe Pro Ala Ala Phe Ala Arg Val Thr Ser Phe Val Ser Trp 
275 280 285 



Val Gin Ser Gin 
290 



<210> 50 
<211> 295 
<212> PRT 

<213> H. punctigera 
<400> 50 

Met Lys Leu Leu Ala Val Thr Leu Leu Ala Phe Ala Ala Val Val Ser 
15 10 15 



Ala Arg Asn He Asp Leu Glu Asp Val He Asp Leu Glu Asp He Thr 
20 25 30 
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Ala Tyr Asp Tyr His Thr Lys lie Gly lie Pro Leu Ala Glu Glu lie 
35 40 45 



Arg Ala Ala Glu Glu Glu Ala Glu Arg Asp Pro Ser Arg lie Val Gly 
50 55 60 



Gly Ser Thr Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala Gly Leu Leu 
65 70 75 80 



Ala Asn Phe Ala Ser Gly Gin Gly Val Cys Gly Gly Ser Leu Leu Asn 
85 90 95 



Gin Arg Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp Gly Arg Asn 
100 105 110 



Gin Ala Arg Ser Phe Thr Val Val Leu Gly Ser Val Arg Leu Phe Ser 
115 120 125 



Gly Gly Thr Arg Leu Asp Thr Ala Ser Val Val Met His Gly Ser Trp 
130 135 140 



Asn Pro Asn Leu He Arg Asn Asp He Ala Met He Asn Leu Pro Ser 
145 150 155 160 



Asn Val Ala Thr Ser Gly Asn He Ala Pro He Ala Leu Pro Ser Gly 
165 170 175 
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Asn Glu Leu Asn Asn Asn Phe Asn Gly Ala Thr Ala Thr Ala Ser Gly 
180 185 190 



Phe Gly Leu Ala Arg Asp Gly Gly Ser Val Asp Gly Asn Leu Arg His 
195 200 205 



Val Asn Leu Pro Val He Thr Asn Ala Val Cys Thr Val Ser Phe Pro 
210 215 220 



Gly He He Gin Ser Ser Asn He Cys Thr Ser Gly Ala Asn Gly Arg 
225 230 235 240 



Ser Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Val Asn Ser Asn 
245 250 255 



Asn Arg Arg He Leu He Gly Val Thr Ser Phe Gly Ser Ala Arg Gly 
260 265 270 



Cys Gin Val Gly Ser Pro Ala Ala Phe Ala Arg Val Thr Ser Phe He 
275 280 285 



Ser Trp He Asn Gin Arg Leu 
290 295 
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<210> 51 
<211> 295 
<212> PRT 

<213> H. punctigera 
<400> 51 

Met Lys Leu Leu Ala Val Thr Leu Leu Ala Phe Ala Ala Val Val Ser 
15 10 15 



Ala Arg Asn He Asp Leu Glu Asp Val He Asp Leu Glu Asp He Thr 
20 25 30 



Ala Tyr Asp Tyr His Thr Lys He Gly He Pro Leu Ala Glu Lys He 
35 40 45 



Arg Ala Ala Glu Glu Glu Ala Glu Arg Asn Pro Ser Arg He Val Gly 
50 55 60 



Gly Ser Thr Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala Gly Leu Leu 
65 70 75 80 



Ala Ser Phe Ala Ser Gly Gin Gly Val Cys Gly Gly Ser Leu Leu Asn 
85 90 95 



Val Arg Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp Gly Arg Asn 
100 105 HO 



Gin Ala Arg Ser Phe Thr Val Val Leu Gly Ser Val Arg Leu Tyr Ser 
115 120 125 
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Gly Gly Thr Arg Leu Asn Thr Ala Ser Val Val Met His Gly Ser Trp 
130 135 140 



Asn Pro Asn Leu Val Arg Asn Asp lie Ala Met He Asn Leu Pro Ser 
145 150 155 160 



Asn Val Ala Thr Ser Gly Asn He Ala Pro He Ala Leu Pro Ser Gly 
165 170 175 



Asn Glu Leu Asn Asn Gin Phe Ala Gly Ala Thr Ala Thr Ala Ser Gly 
180 185 190 



Phe Gly Leu Ala Arg Asp Gly Gly Val He Asp Gly Asn Leu Arg His 
195 200 205 



Val Asn Leu Pro Val He Thr Asn Ala Val Cys Ser Gin Ser Phe Pro 
210 215 220 



Gly Leu He Gin Ala Ser Asn Val Cys Thr Ser Gly Ala Asn Gly Arg 
225 230 235 240 



Ser Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Val Asn Ser Asn 
245 250 255 



Asn Arg Arg He Leu He Gly Val Thr Ser Phe Gly Ser Ala Arg Gly 
260 265 270 
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Cys Gin Val Gly Ser Pro Ala Ala Phe Ala Arg Val Ser Ser Tyr He 
275 280 285 



Ser Trp He Asn Gin Arg Leu 
290 295 



<210> 52 
<211> 234 
<212> PRT 

<213> H. punctigera 
<400> 52 

He Val Gly Gly Ser Ser Ala Ser Leu Gly Gin Phe Pro Tyr Gin Ala 
15 10 15 



Gly Leu Leu He Asn Leu Pro Leu Gly Gin Ser Val Cys Gly Gly Ser 
20 25 30 



Leu Leu Asn Gin Arg Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp 
35 40 45 



Gly Arg Asn Gin Ala Thr Ser Leu Thr Val He Leu Gly Ser He Asn 
50 55 60 



Leu Phe Phe Gly Gly Thr Arg Leu Asn Ser Asn Ser Val Val Met Gin 
65 70 75 80 



Gly Ser Trp Asn Pro Asn Leu He Arg Asn Asp He Ala He He Asn 
85 90 95 
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Leu Pro Ser Asn Val Gly Thr Ser Gly Asn He Ala Pro He Ala Leu 
100 105 HO 



Pro Ser Gly Asn Glu Leu Asn Asn Gin Phe Ala Gly Phe Thr Ala Thr 
115 120 125 



Ala Ser Gly Phe Gly Leu Thr Arg Asp Gly Gly Asn Val Ser Pro Thr 
130 135 140 



Leu Asn His Val Asn Leu Pro Val He Thr Asn Asn Val Cys Trp Gin 
145 150 155 160 



Ser Phe Pro Leu Tyr He Gin Ser Thr Asn He Cys Thr Ser Gly Ala 
165 170 175 



Asn Gly Arg Gly Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Val 
180 185 190 



Thr Ser Asn Asn Arg Arg He Leu He Gly Val Thr Ser Phe Gly Ser 
195 200 205 



Asp Arg Gly Cys Gin Val Gly Ala Pro Ala Ala Phe Ala Arg Val Thr 
210 215 220 



Ser Tyr He Ser Trp He Asn Gin Arg Leu 
225 230 
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<210> 53 

<211> 296 

<212> PRT 

<213> H. punctigera 

<400> 53 

Met Ala Ala Ala Tyr Leu Leu Gly Leu Leu Phe Val Leu Gly Tyr Val 
1 5 10 15 



Gin Gly Gly Leu Leu Asn Ala Asp Pro Ala He He Glu Asp Leu Arg 
20 25 30 



Asp Ala Glu Phe Ser Ser Gly Ser Arg He Val Ala Gly Trp Pro Ala 
35 40 45 



Val Glu Gly Gin He Pro Tyr Gin Gly Ser Leu Arg Met Val Ser Ala 
50 55 60 



He Gly Gly Val Ser Ser Cys Gly Cys Ser Leu He His Asn Lys Trp 
65 70 75 80 



Val Leu Thr Ala Ala His Cys Leu Ala Asn Arg He Thr Phe Val Val 
85 90 95 



Arg Phe Gly Leu Thr Asn Leu Thr Arg Pro Glu He Leu Val Glu Ser 
100 105 HO 



Thr Asn Lys Tyr He His Pro Glu Tyr Asp Glu He Arg Ala Gly Val 
115 120 125 
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Gln Thr Ala Asp Leu Ala Leu Val Gly Leu Asp His Glu He Glu Tyr 
130 135 140 



Ser Ala Asn Val Gin Pro Ser Arg Leu Met Ser Ser Ala Gin Lys Asn 
145 150 155 160 



He Asn Tyr Glu Gly He Gin Met He Val Ser Gly Phe Gly Arg Thr 
165 170 175 



Asp Asp Leu Trp Asn Gly Gly Ala Ala Ser Glu He Leu Leu Trp Val 
180 185 190 



Tyr Gin Arg Gly Val Ser Asn Glu Glu Cys Leu Arg Trp Tyr Pro Thr 
195 200 205 



Ser Gin Val lie Lys Glu Gin Thr He Cys Ala Gly Tyr Trp Asp Asn 
210 215 220 



Pro Ser Gin Ser Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Thr He 
225 230 235 240 



He Asp Ala Asp Gly Glu Arg Thr Gin Ser Arg Tyr Cys Glu Leu Arg 
245 250 255 



He His Cys Trp Asn Ala Ala His Ser Pro Gin Gly Tyr Val Arg Pro 
260 265 270 
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Gly His Tyr His Asp Trp Phe Thr Glu Val Thr Gly He Asn Phe Asp 
275 280 285 



Trp Asp Ser Asp Ala He He Pro 
290 295 



<210> 54 

<211> 365 

<212> PRT 

<213> H. punctigera 

<400> 54 

Met Ala Ala Ala Tyr Leu Leu Gly Leu Leu Phe Val Leu Gly Tyr Val 
15 10 15 



Gin Gly Gly Leu Leu Asn Ala Asp Pro Ala He He Glu Asp Leu Arg 
20 25 30 



Asp Ala Glu Phe Ser Ser Phe Ser Arg He Val Ala Gly Trp Pro Ala 
35 40 45 



Val Glu Gly Gin He Pro Tyr Gin Gly Ser Leu Arg Met Val Ser Ala 
50 55 60 



He Gly Gly Val Ser Ser Cys Gly Cys Ser Leu He His Asn Lys Trp 
65 70 75 80 



Val Leu Thr Ala Ala His Cys Leu Ala Asn Arg He Thr Phe Val Val 
85 90 95 
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Arg Phe Gly Leu Thr Asn Leu Thr Arg Pro Glu He Leu Val Glu Ser 
100 105 HO 



Thr Asn Lys Tyr He His Pro Glu Tyr Asp Glu He Arg Ala Gly Val 
115 120 125 



Gin Thr Ala Asp Leu Ala Leu Val Gly Leu Asp Gin Glu He Glu Tyr 
130 135 140 



Ser Ala Asn Val Gin Pro Ser Arg Leu Met Ser Ser Ala Gin Lys Asn 
145 150 155 160 



He Asn Tyr Glu Gly He Gin Met He Val Ser Gly Phe Gly Arg Thr 
165 170 175 



Asp Asp Leu Trp Asn Gly Gly Ala Ala Ser Glu He Leu Leu Trp Val 
180 185 190 



Tyr Gin Arg Gly Val Ser Asn Glu Glu Cys Leu Arg Trp Tyr Pro Thr 
195 200 205 



Ser Gin Val He Lys Glu Gin Thr He Cys Ala Gly Tyr Trp Asp Asn 
210 215 220 



Pro Ser Gin Ser Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Thr He 
225 230 235 240 
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Ile Asp Ala Asp Gly Glu Arg Thr Gin Val Gly He Val Ser Ser Asp 
245 250 255 



Pro Leu Leu Asp Ala Thr Val His Ser Pro Arg Val Thr Ser Pro Gly 
260 265 270 



His Tyr His Asp Gly His Arg Gly Asp Arg His Gin Leu Arg Leu Gly 
275 280 285 



Gin Arg Arg His Tyr Pro Asp Ser Ser Glu Ser Ser Leu Arg Ala Ala 
290 295 300 



He Leu Pro Leu Glu Ser Ser Arg Ala Phe He Arg Arg Asn Gin Ser 
305 310 315 320 



Ser Phe Arg Gly Gly Leu Cys Gin Pro Pro Arg Phe Pro Thr Arg Thr 
325 330 335 



Val Pro Thr His Leu Pro Arg Arg Thr Leu Ala Ala Pro Pro Ser Glu 
340 345 350 



Glu Glu Glu Ala Pro Glu Lys Val Arg Val Val Glu Tyr 
355 360 365 
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<210> 55 
<211> 36 
<212> PRT 

<213> H. punctigera 
<400> 55 

He Val Gly Gly Ser Leu Ser Ser Val Gly Gin He Pro Tyr Gin Ala 
15 10 15 

Gly Leu Val He Asp Leu Ala Gly Gly Gin Ala Val Cys Gly Gly Ser 
20 25 30 



Leu He Ser Ala 
35 



<210> 56 
<211> 30 
<212> PRT 

<213> H. punctigera 
<400> 56 

He Val Gly Gly Ser Thr Ser Ser Val Gly Gin Phe Pro Tyr Gin Ala 
15 10 15 



Gly Leu Leu Ala Ser Phe Ala Gly Gly Gin Ala Val Cys Gly 
20 25 30 
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<210> 57 
<211> 37 
<212> PRT 

<213> H. punctigera 
<400> 57 

lie Val Gly Gly Ser He Thr Asp He Ala Asn Val Pro Tyr Gin Ala 
15 10 15 

Gly Leu Val He Thr He Phe He Phe Gin Ser Val Cys Gly Ala Ser 
20 25 30 



Leu He Ser His Asn 
35 



<210> 58 
<211> 37 
<212> PRT 

<213> H. punctigera 
<400> 58 

He Val Gly Gly Ser He Thr Asn He Ala Asn Val Pro Tyr Gin Ala 
15 10 15 

Gly Leu Val He Thr He Phe He Phe Gin Ser Val Cys Gly Ala Ser 
20 25 30 



Leu He Ser His Asn 
35 
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<210> 59 
<211> 37 
<212> PRT 

<213> H. punctigera 
<400> 59 

He Val Gly Gly Ser Thr Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala 
15 10 15 



Gly Leu Leu Ala Ser Phe Ala Ser Gly Gin Gly Val Cys Gly Gly Ser 
20 25 30 

I 

Leu Leu Asn Val Arg 
35 



<210> 60 
<211> 37 
<212> PRT 

<213> H. punctigera 
<400> 60 

He Val Gly Gly Ser Thr Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala 
15 10 15 

Gly Leu Leu Ala Asn Phe Ala Ser Gly Gin Gly Val Cys Gly Gly Ser 
20 25 30 



Leu Leu Asn Gin Arg 
35 
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<210> 61 
<211> 37 
<212> PRT 

<213> H. punctigera 
<400> 61 

lie Val Gly Gly Ser Ser Ala Ser Leu Gly Gin Phe Pro Tyr Gin Ala 
15 10 15 

Gly Leu Ser Leu He Tyr Ser Gly Gin Ser Val Cys Gly Gly Ser Leu 
20 25 30 



Leu Asn Gin Arg Arg 
35 



<210> 62 
<211> 37 
<212> PRT 

<213> H. punctigera 
<400> 62 

He Val Ala Gly Trp Pro Ala Val Glu Gly Gin He Pro Tyr Gin Gly 
15 10 15 

Ser Leu Arg Met Val Ser Ala He Gly Gly Val Ser Ser Cys Gly Cys 
20 25 30 



Ser Leu He His Asn 
35 
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<210> 63 
<211> 235 
<212> PRT 

<213> H, punctigera 
<400> 63 

He Val Gly Gly Ser He Thr Asp He Ala Asn Val Pro Tyr Gin Ala 
15 10 15 



Gly Leu Val He Thr He Phe He Phe Gin Ser Val Cys Gly Ala Ser 
20 25 30 



Leu He Ser His Asn Arg Leu Val Thr Ala Ala His Cys Lys Ser Asp 
35 40 45 



Gly Val Leu Thr Ala Asn Ser Phe Thr Val Val Leu Gly Ser Asn Thr 
50 55 60 



Leu Phe Phe Gly Gly Thr Arg He Asn Thr Asn Asp Val Val Met His 
65 70 75 80 



Pro Asn Trp Asn Pro Ser Thr Ala Ala Asn Asp He Ala Val Met Arg 
85 90 95 



He Ser Ser Val Ser Phe Ser Asn Val He Gin Pro He Ala Leu Pro 
100 105 HO 



Ser Gly Asp Glu Leu Asn Asn Leu Phe Val Gly Ala Asn Ala Leu Ala 
115 120 125 
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Phe Gly Phe Gly Arg Thr Ser Asp Gly Gly Ser He Gly Ser Asn Gin 
130 135 140 



Gin Val Ser Ser Val Thr He Pro Val He Thr Asn Asp Glu Cys Ala 
145 150 155 160 



Ala Val Tyr Gly Ser Ala Phe Val His Ser Ser Asn He Cys Thr Ser 
165 170 175 



Gly Ala Gly Gly Lys Gly Thr Cys Asn Gly Asp Ser Gly Gly Pro Leu 
180 185 190 



Ala He Asp Ser Asn Asn Glu Lys He Leu He Gly Val Thr Ser Tyr 
195 200 205 



Gly Ala Gin Ala Gly Cys Ala Ala Gly Leu Pro Ala Ala Phe Ala Arg 
210 215 220 



Val Thr Ser Phe Val Ser Trp Val Gin Ser Gin 
225 230 235 



<210> 64 

<211> 235 

<212> PRT 

<213> H. punctigera 

<400> 64 



He Val Gly Gly Ser He Thr Asn He Ala Asn Val Pro Tyr Gin Ala 
15 10 15 
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Gly Leu Val lie Thr lie Phe lie Phe Gin Ser Val Cys Gly Ala Ser 
20 25 30 



Leu lie Ser His Asn Arg Leu Val Thr Ala Ala His Cys Lys Phe Asp 
35 40 45 



Gly Val Met Thr Ala Asn Ser Phe Thr Val Val Leu Gly Ser Asn Thr 
50 55 60 



Leu Phe Phe Gly Gly Thr Arg He Asn Thr Asn Asp Val Val Met His 
65 70 75 80 



Pro Asn Trp Asn Pro Ser Thr Val Ala Asn Asp He Ala Val He Arg 
85 90 95 



He Ser Ser He Val Tyr Asn Asn Val He Gin Pro He Ala Leu Pro 
100 105 110 



Ser Gly Asp Glu Leu Asp Asn Leu Phe Val Gly Ala Asn Ala Leu Ala 
115 120 125 



Ser Gly Phe Gly Arg Thr Ser Asp Ser Gly Gly He Gly Thr Asn Gin 
130 135 140 



Gin Leu Ser Ser Val Thr He Pro Val He Thr Asn Ala Glu Cys Ala 
145 150 155 160 
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Ala Val Tyr Gly Pro Ala Phe Val His Asp Thr Asn lie Cys Thr Ser 
165 170 175 



Gly Ala Gly Gly Lys Gly Thr Cys Asn Gly Asp Ser Gly Gly Pro Leu 
180 185 190 



Ala Val Asp Ser Asn Asp Lys Lys He Leu He Gly Val Thr Ser Tyr 
195 200 205 



Gly Ala Ala Asp Gly Cys Ala Ala Gly Phe Pro Ala Ala Phe Ala Arg 
210 215 220 



Val Thr Ser Phe Val Ser Trp Val Gin Ser Gin 
225 230 235 



<210> 65 
<211> 234 
<212> PRT 

<213> H. punctigera 
<400> 65 

He Val Gly Gly Ser Thr Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala 
15 10 15 



Gly Leu Leu Ala Ser Phe Ala Ser Gly Gin Gly Val Cys Gly Gly Ser 
20 25 30 



Leu Leu Asn Val Arg Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp 
35 40 45 
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Gly Arg Asn Gin Ala Arg Ser Phe Thr Val Val Leu Gly Ser Val Arg 
50 55 60 



Leu Tyr Ser Gly Gly Thr Arg Leu Asn Thr Ala Ser Val Val Met His 
65 70 75 80 



Gly Ser Trp Asn Pro Asn Leu Val Arg Asn Asp lie Ala Met lie Asn 
85 90 95 



Leu Pro Ser Asn Val Ala Thr Ser Gly Asn lie Ala Pro He Ala Leu 
100 105 110 



Pro Ser Gly Asn Glu Leu Asn Asn Gin Phe Ala Gly Ala Thr Ala Thr 
115 120 125 



Ala Ser Gly Phe Gly Leu Ala Arg Asp Gly Gly Val He Asp Gly Asn 
130 135 140 



Leu Arg His Val Asn Leu Pro Val He Thr Asn Ala Val Cys Ser Gin 
145 150 155 160 



Ser Phe Pro Gly Leu He Gin Ala Ser Asn Val Cys Thr Ser Gly Ala 
165 170 175 



Asn Gly Arg Ser Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Val 
180 185 190 
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Asn Ser Asn Asn Arg Arg He Leu He Gly Val Thr Ser Phe Gly Ser 
195 200 205 



Ala Arg Gly Cys Gin Val Gly Ser Pro Ala Ala Phe Ala Arg Val Ser 
210 215 220 



Ser Tyr He Ser Trp He Asn Gin Arg Leu 
225 230 



<210> 66 
<211> 234 
<212> PRT 

<213> H. punctigera 
<400> 66 

He Val Gly Gly Ser Thr Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala 
15 10 15 



Gly Leu Leu Ala Asn Phe Ala Ser Gly Gin Gly Val Cys Gly Gly Ser 
20 25 30 



Leu Leu Asn Gin Arg Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp 
35 40 45 



Gly Arg Asn Gin Ala Arg Ser Phe Thr Val Val Leu Gly Ser Val Arg 
50 55 60 



Leu Phe Ser Gly Gly Thr Arg Leu Asp Thr. Ala Ser Val Val Met His 
65 70 75 80 
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Gly Ser Trp Asn Pro Asn Leu lie Arg Asn Asp lie Ala Met lie Asn 
85 90 95 



Leu Pro Ser Asn Val Ala Thr Ser Gly Asn lie Ala Pro lie Ala Leu 
100 105 110 



Pro Ser Gly Asn Glu Leu Asn Asn Asn Phe Asn Gly Ala Thr Ala Thr 
115 120 125 



Ala Ser Gly Phe Gly Leu Ala Arg Asp Gly Gly Ser Val Asp Gly Asn 
130 135 140 



Leu Arg His Val Asn Leu Pro Val He Thr Asn Ala Val Cys Thr Val 
145 150 155 160 



Ser Phe Pro Gly He He Gin Ser Ser Asn He Cys Thr Ser Gly Ala 
165 170 175 



Asn Gly Arg Ser Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Val 
180 185 190 



Asn Ser Asn Asn Arg Arg He Leu He Gly Val Thr Ser Phe Gly Ser 
195 200 205 



Ala Arg Gly Cys Gin Val Gly Ser Pro Ala Ala Phe Ala Arg Val Thr 
210 215 220 
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Ser Phe lie Ser Trp lie Asn Gin Arg Leu 
225 230 



<210> 67 
<211> 282 
<212> PRT 

<213> H. punctigera 
<400> 67 

He Val Gly Gly Ser Ser Ala Ser Leu Gly Gin Phe Pro Tyr Gin Ala 
15 10 15 



Gly Leu Ser Leu. He Tyr Ser Gly Gin Ser Val Cys Gly Gly Ser Leu 
20 25 30 



Leu Asn Gin Arg Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp Gly 
35 40 45 



He Val Ala Gly Trp Pro Ala Val Glu Gly Gin He Pro Tyr Gin Gly 
50 55 60 



Ser Leu Arg Met Val Ser Ala He Gly Gly Val Ser Ser Cys Gly Cys 
65 70 75 80 



Ser Leu He His Asn Lys Trp Val Leu Thr Ala Ala His Cys Leu Ala 
85 90 95 



Asn Arg Asn Gin Ala Thr Ser Leu Thr Val He Leu Gly Ser He Asn 
100 105 110 
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Leu Phe Phe Gly Gly Thr Arg Leu Asn Ser Asn Ser Val Val Met His 
115 120 125 



Gly Ser Trp Asn Pro Asn Leu lie Arg Asn Asp He Ala He He Asn 
130 135 140 



Leu Pro Ser Asn Val Gly Thr Ser Gly Asn He Ala Pro He Ala Leu 
145 150 155 160 



Pro Ser Gly Asn Glu Leu Asn Asn Gin Phe Ala Gly Phe Thr Ala Thr 
165 170 175 



Ala Ser Gly Phe Gly Leu Thr Arg Asp Gly Gly Asn Val Ser Pro Thr 
180 185 190 



Leu Asn His Val Asn Leu Pro Val He Thr Asn Asn Val Cys Trp Gin 
195 200 205 



Ser Phe Pro Leu Tyr He Gin Ser Thr Asn He Cys Thr Ser Gly Ala 
210 215 220 



Asn Gly Arg Gly Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Val 
225 230 235 240 



Thr Ser Asn Asn Arg Arg He Leu He Gly Val Thr Ser Phe Gly Ser 
245 250 255 
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Asp Arg Gly Cys Gin Val Gly Ala Pro Ala Ala Phe Ala Arg Val Thr 
260 265 270 



Ser Tyr lie Ser Trp lie Asn Gin Arg Leu 
275 280 



<210> 68 
<211> 256 
<212> PRT 

<213> H. punctigera 
<400> 68 

He Val Ala Gly Trp Pro Ala Val Glu Gly Gin He Pro Tyr Gin Gly 
15 10 15 



Ser Leu Arg Met Val Ser Ala He Gly Gly Val Ser Ser Cys Gly Cys 
20 25 30 



Ser Leu He His Asn Lys Trp Val Leu Thr Ala Ala His Cys Leu Ala 
35 40 45 



Asn Arg He Thr Phe Val Val Arg Phe Gly Leu Thr Asn Leu Thr Arg 
50 55 60 



Pro Glu He Leu Val Glu Ser Thr Asn Lys Tyr He His Pro Glu Tyr 
65 70 75 80 



Asp Glu He Arg Ala Gly Val Gin Thr Ala Asp Leu Ala Leu Val Gly 
85 90 95 
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Leu Asp His Glu lie Glu Tyr Ser Ala Asn Val Gin Pro Ser Arg Leu 
100 105 110 



Met Ser Ser Ala Gin Lys Asn He Asn Tyr Glu Gly He Gin Met He 
115 120 125 



Val Ser Gly Phe Gly Arg Thr Asp Asp Leu Trp Asn Gly Gly Ala Ala 
130 135 140 



Ser Glu He Leu Leu Trp Val Tyr Gin Arg Gly Val Ser Asn Glu Glu 
145 150 155 160 



Cys Leu Arg Trp Tyr Pro Thr Ser Gin Val He Lys Glu Gin Thr He 
165 170 175 



Cys Ala Gly Tyr Trp Asp Asn Pro Ser Gin Ser Ser Cys Gin Gly Asp 
180 185 190 



Ser Gly Gly Pro Leu Thr He He Asp Ala Asp Gly Glu Arg Thr Gin 
195 200 205 



Ser Arg Tyr Cys Glu Leu Arg He His Cys Trp Asn Ala Thr Ala His 
210 215 220 



Ser Pro Gin Gly Tyr Val Arg Pro Gly His Tyr His Asp Trp Phe Thr 
225 230 235 240 
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Glu Val Thr Gly lie Asn Phe Asp Trp Asp Ser Asp Ala lie lie Pro 
245 250 255 



<210> 69 
<211> 236 
<212> PRT 

<213> H. punctigera 
<400> 69 

He Val Gly Gly Ser Leu Ser Ser Val Gly Gin He Pro Tyr Gin Ala 
15 10 15 



Gly Leu Val He Asp Leu Ala Gly Gly Gin Ala Val Cys Gly Gly Ser 
20 25 30 



Leu He Ser Ala Ser Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp 
35 40 45 



Gly Gin Asn Gin Ala Trp Arg Phe Thr Val Val Leu Gly Ser Thr Thr 
50 55 60 



Leu Phe Ser Gly Gly Thr Arg He Pro Thr Ser Asn Val Val Met His 
65 70 75 80 



Gly Ser Trp Thr Pro Ser Leu He Arg Asn Asp Val Ala Val He Arg 
85 90 95 



Leu Gly Thr Asn Val Ala Thr Ser Asn Thr He Ala He He Ala Leu 
100 105 110 



WO 2004/094630 



PCT/AU2004/000524 



-72- 



Pro Ser Gly Ser Gin lie Asn Glu Asn Phe Ala Gly Glu Thr Ala Leu 
115 120 125 



Ala Ser Gly Phe Gly Leu Thr Ser Asp Thr Gly Ser He Ser Ser Asn 
130 135 140 



Gin Ala Leu Ser His Val Asn Leu Pro Val He Thr Asn Ala Val Cys 
145 150 155 160 



Arg Asn Ser Phe Pro Leu Leu He Gin Asp Ser Asn He Cys Thr Ser 
165 170 175 



Gly Ala Asn Gly Arg Ser Thr Cys Arg Gly Asp Ser Gly Gly Pro Leu 
180 185 190 



Val Val Thr Arg Asn Asn Arg Pro Leu Leu He Gly He Thr Ser Phe 
195 200 205 



Gly Ser Ala Arg Gly Cys Gin Val Gly Ser Pro Ala Ala Phe Ala Arg 
210 215 220 



Val Thr Ser Tyr He Ser Trp He Asn Gly Gin Leu 
225 230 235 
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<210> 70 
<211> 224 
<212> PRT 

<213> H, punctigera 
<400> 70 

He Val Gly Gly Tyr Thr Cys Glu Glu Asn Ser Leu Pro Tyr Gin Val 
15 10 15 



Ser Leu Asn Ser Gly Ser His Phe Cys Gly Gly Ser Leu He Ser Glu 
20 25 30 



Gin Trp Val Val Ser Ala Ala His Cys Tyr Lys Thr Arg He Gin Val 
35 40 45 



Arg Leu Gly Glu His Asn He Lys Val Leu Glu Gly Asn Glu Gin Phe 
50 55 60 



He Asn Ala Ala Lys He He Arg His Pro Lys Tyr Asn Arg Asp Thr 
65 70 75 80 



Leu Asp Asn Asp He Met Leu He Lys Leu Ser Ser Pro Ala Val He 
85 90 95 



Asn Ala Arg Val Ser Thr He Ser Leu Pro Thr Ala Pro Pro Ala Ala 
100 105 110 



Gly Thr Glu Cys Leu He Ser Gly Trp Gly Asn Thr Leu Ser Phe Gly 
115 120 125 
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Ala Asp Tyr Pro Asp Glu Leu Lys Cys Leu Asp Ala Pro Val Leu Thr 
130 135 140 



Gin Ala Glu Cys Lys Ala Ser Tyr Pro Gly Lys He Thr Asn Ser Met 
145 150 155 160 



Phe Cys Val Gly Phe Leu Glu Gly Gly Lys Asp Ser Cys Gin Arg Asp 
165 170 175 



Ser Gly Gly Pro Val Val Cys Asn Gly Gin Leu Gin Gly Val Val Ser 
180 185 190 



Trp Gly His Gly Cys Ala Trp Lys Asn Arg Pro Gly Val Tyr Thr Lys 
195 200 205 



Val Tyr Asn Tyr Val Asp Trp He Lys Asp Thr He Ala Ala Asn Ser 
210 215 220 



<210> 71 

<211> 275 

<212> PRT 

<213> H. armigera 

<400> 71 

Val His Leu Glu Asp Ser He Asp Leu Glu Asp He Thr Ala Trp Gly 
15 10 15 



Tyr Leu Thr Lys Phe Gly He Pro Glu Ala Glu Lys He Arg Asn Ala 
20 25 30 
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Glu Glu Ala Ser Ser Ala Ser Arg He Val Gly Gly Ser Leu Ser Ser 
35 40 45 



Leu Gly Gin He Pro Tyr Gin Ala Gly Leu Val He Asp Leu Ser Gly 
50 55 60 



Gly Gin Ala Val Cys Gly Gly Ser Leu He Ser Ala Ser Arg Val Leu 
65 70 75 80 



Thr Ala Ala His Cys Trp Phe Asp Gly Gin Asn Gin Ala Trp Arg Phe 
85 90 95 



Thr Val Val Leu Gly Ser Thr Thr Leu Phe Ser Gly Gly Thr Arg He 
100 105 HO 



Ala Thr Ser Asn Val Val Met His Gly Ser Trp Thr Pro Ser Leu He 
115 120 125 



Arg Asn Asp Val Ala Val He Arg Leu Gly Thr Asn Val Gly Thr Ser 
130 135 140 



Asn Thr He Ala He He Ala Leu Pro Ser Gly Ser Gin He Asn Glu 
145 150 155 160 



Asn Phe Ala Gly Glu Thr Ala Leu Ala Ser Gly Phe Gly Leu Thr Ser 
165 170 175 
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Asp Ser Gly Ser He Ser Ser Asn Gin Ala Leu Ser His Val Asn Leu 
180 185 190 



Pro Val He Thr Asn Ala Val Cys Arg Ser Ser Phe Pro Leu Leu He 
195 200 205 



Gin Asp Ser Asn He Cys Thr Ser Gly Ala Asn Gly Arg Ser Thr Cys 
210 215 220 



Arg Gly Asp Ser Gly Gly Pro Leu Val Val Thr Arg Asn Ser Arg Pro 
225 230 235 240 



Leu Leu He Gly He Thr Ser Phe Gly Ser Ala Arg Gly Cys Gin Val 
245 250 255 



Gly Ser Pro Ala Ala Phe Ala Arg Val Thr Ser Tyr He Ser Trp He 
260 265 270 



Asn Gly Gin 
275 

<210> 72 
<211> 275 
<212> PRT 

<213> H. punctigera 
<400> 72 

Val His Leu Glu Asp Ser He Asp Leu Glu Asp He Thr Ala Trp Gly 
15 10 15 
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Tyr Leu Thr Lys Phe Gly He Pro Glu Ala Glu Lys He Arg Asn Ala 
20 25 30 



Glu Glu Ala Ser Ser Ala Ser Arg He Val Gly Gly Ser Leu Ser Ser 
35 40 45 



Leu Gly Gin He Pro Tyr Gin Ala Gly Leu Val He Asp Leu Ala Gly 
50 55 60 



Gly Gin Ala Val Cys Gly Gly Ser Leu He Ser Ala Ser Arg Val Leu 
65 70 75 80 



Thr Ala Ala His Cys Trp Phe Asp Gly Gin Asn Gin Ala Trp Arg Phe 
85 90 95 



Thr Val Val Leu Gly Ser Thr Thr Leu Phe Ser Gly Gly Thr Arg He 
100 105 HO 



Pro Thr Ser Asn Val Val Met His Gly Ser Trp Thr Pro Ser Leu He 
115 120 125 



Arg Asn Asp Val Ala Val He Arg Leu Gly Thr Asn Val Gly Thr Ser 
130 135 140 



Asn Thr He Ala He He Ala Leu Pro Ser Gly Ser Gin He Asn Glu 
145 150 155 160 
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Asn Phe Ala Gly Glu Thr Ala Leu Ala Ser Gly Phe Gly Leu Thr Ser 
165 170 175 



Asp Thr Gly Ser lie Ser Ser Asn Gin Ala Leu Ser His Val Asn Leu 
180 185 190 



Pro Val lie Thr Asn Ala Val Cys Arg Asn Ser Phe Pro Leu Leu lie 
195 200 205 



Gin Asp Ser Asn He Cys Thr Ser Gly Ala Asn Gly Arg Ser Thr Cys 
210 215 220 



Arg Gly Asp Ser Gly Gly Pro Leu Val Val Thr Arg Asn Asn Arg Pro 
225 230 235 240 



Leu Leu He Gly He Thr Ser Phe Gly Ser Ala Arg Gly Cys Gin Val 
245 250 255 



Gly Ser Pro Ala Ala Phe Ala Arg Val Thr Ser Tyr He Ser Trp He 
260 265 270 



Asn Gly Gin 
275 
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<210> 73 
<211> 230 
<212> PRT 

<213> H. punctigera 
<400> 73 

He Val Asn Gly Glu Asp Ala Val Pro Gly Ser Trp Pro Trp Gin Val 
15 10 15 



Ser Leu Gin Asp Ser Thr Gly Phe His Phe Cys Gly Gly Ser Leu He 
20 25 30 



Ser Glu Asp Trp Val Val Thr Ala Ala His Cys Gly Val Thr Thr Ser 
35 40 45 



Asp Val Val Val Ala Gly Glu Phe Asp Gin Gly Ser Ser Ser Glu Lys 
50 55 60 



He Gin Lys Leu Lys He Ala Lys Val Phe Lys Asn Ser Lys Tyr Asn 
65 70 75 80 



Ser Leu Thr He Asn Asn Asp He Thr Leu Leu Lys Leu Ala Thr Pro 
85 90 95 



Ala Gin Phe Ser Glu Thr Val Ser Ala Val Cys Leu Pro Ser Ala Asp 
100 105 HO 



Glu Asp Phe Pro Ala Gly Met Leu Cys Ala Thr Thr Gly Trp Gly Lys 
115 120 125 
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Thr Lys Tyr Asn Ala Leu Lys Thr Pro Asp Lys Leu Gin Gin Ala Thr 
130 135 140 



Leu Pro lie Val Ser Asn Thr Asp Cys Arg Lys Tyr Trp Gly Ser Arg 
145 150 155 160 



Val Thr Asp Val Met He Cys Ala Gly Ala Ser Gly Val Ser Ser Cys 
165 170 175 



Met Gly Asp Ser Gly Gly Pro Leu Val Cys Gin Lys Asn Gly Ala Trp 
180 185 190 



Thr Leu Ala Gly He Val Ser Trp Gly Ser Ser Thr Cys Ser Thr Ser 
195 200 205 



Thr Pro Ala Val Tyr Ala Arg Val Thr Ala Leu Met Pro Trp Val Gin 
210 215 220 



Glu Thr Leu Ala Ala Asn 
225 230 



<210> 74 

<211> 230 

<212> PRT 

<213> H. punctigera 

<400> 74 



He Val Asn Gly Glu Glu Ala Val Pro Gly Ser Trp Pro Trp Gin Val 
15 10 15 
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Ser Leu Gin Asp Lys Thr Gly Phe His Phe Cys Gly Gly Ser Leu lie 
20 25 30 



Asn Glu Asn Trp Val Val Thr Ala Ala His Cys Gly Val Thr Thr Ser 
35 40 45 



Asp Val Val Val Ala Gly Glu Phe Asp Gin Gly Leu Glu Thr Glu Asp 
50 55 60 



Thr Gin Val Leu Lys He Gly Lys Val Phe Lys Asn Pro Lys Phe Ser 
65 70 75 80 



He Leu Thr Val Arg Asn Asp He Thr Leu Leu Lys Leu Ser Thr Ala 
85 90 95 



Ala Ser Phe Ser Gin Thr Val Ser Ala Val Cys Leu Pro Ser Ala Ser 
100 105 HO 



Asp Asp Phe Ala Ala Gly Thr Thr Cys Val. Thr Thr Gly Trp Gly Leu 
115 120 125 



Thr Arg Tyr Thr Asn Ala Asn Thr Pro Asp Arg Leu Gin Gin Ala Ser 
130 135 140 



Leu Pro Leu Leu Ser Asn Thr Asn Cys Lys Lys Tyr Trp Gly Thr Lys 
145 150 155 160 
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Ile Lys Asp Ala Met He Cys Ala Gly Ala Ser Gly Val Ser Ser Cys 
165 170 175 



Met Gly Asp Ser Gly Gly Pro Leu Val Cys Lys Gin Asn Gly Ala Trp 
180 185 190 



Thr Leu Val Gly He Val Ser Trp Gly Ser Ser Thr Cys Ser Thr Ser 
195 200 205 



Thr Pro Gly Val Tyr Ala Arg Val Thr Ala Leu Val Asn Trp Val Gin 
210 215 220 



Gin Thr Leu Ala Ala Asn 
225 230 



<210> 75 
<211> 237 
<212> PRT 

<213> H. punctigera 
<400> 75 

He Val Gly Gly Ser Thr Ser Ser Leu Gly Ala Phe Pro Tyr Gin Ala 
15 10 15 



Gly Leu Leu Ala Ser Phe Ala Ser Gly Gin Gly Val Cys Gly Gly Ser 
20 25 30 



Leu Leu Asn Val Arg Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp 
35 40 45 
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Gly Arg Asn Gin Ala Arg Ser Phe Thr Val Val Leu Gly Ser Val Arg 
50 55 60 



Leu Tyr Ser Gly Gly Thr Arg Leu Asn Thr Ala Ser Val Val Met His 
65 70 75 80 



Gly Ser Trp Asn Pro Asn Leu Val Arg Thr He Asn Asn Asp He Ala 
85 90 95 



Met He Asn Leu Pro Ser Asn Val Ala Thr Ser Gly Asn He Ala Pro 
100 105 HO 



He Ala Leu Pro Ser Gly Asn Glu Leu Asn Asn qin Phe Ala Gly Ala 
115 120 125 



Thr Ala Thr Ala Ser Gly Phe Gly Leu Ala Arg Asp Gly Gly Val He 
130 135 140 



Asp Gly Asn Leu Arg His Val Asn Leu Pro Val He Thr Asn Ala Val 
145 150 155 160 



Cys Ser Gin Ser Phe Pro Gly Leu He Gin Ala Ser Asn Val Cys Thr 
165 170 175 



Ser Gly Ala Asn Gly Arg Ser Thr Cys Gin Gly Gly Asp Ser Gly Gly 
180 185 190 
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Pro Leu Val Asn Ser Asn Asn Arg Arg He Leu He Gly Val Thr Ser 
195 200 205 



Phe' Gly Ser Ala Arg Gly Cys Gin Val Gly Ser Pro Ala Ala Phe Ala 
210 215 220 



Arg Val Ser Ser Tyr He Ser Trp He Asn Gin Arg Leu 
225 230 235 



<210> 76 
<211> 236 
<212> PRT 

<213> H. punctigera 
<400> 76 

He Val Gly Gly Ser Leu Ser Ser Val Gly Gin He Pro Tyr Gin Ala 
15 10 15 



Gly Leu Val He Asp Leu Ala Gly Gly Gin Ala Val Cys Gly Gly Ser 
20 25 30 



Leu Leu Ser Ala Ser Arg Val Leu Thr Ala Ala His Cys Trp Phe Asp 
35 40 45 



Gly Gin Asn Gin Ala Trp Arg Phe Thr Val Val Leu Gly Ser Thr Thr 
50 55 60 



Leu Phe Ser Gly Gly Thr Arg Leu Asn He Pro Ser Ser Asn Met His 
65 70 75 80 
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Gly Ser Trp Asn Pro Ser Leu lie Arg Asn Asp Val Ala Val lie Arg 
85 90 95 



Leu Gly Thr Asn Val Ala Thr Ser Asn Thr He Ala He He Ala Leu 
100 105 110 



Pro Ser Gly Ser Gin He Asn Glu Asn Phe Ala Gly Glu Thr Ala Leu 
115 120 125 



Ala Ser Gly Phe Gly Leu Thr Ser Tyr Thr Gly Ser He Ser Ser Asn 
130 135 140 



Gin Ala Leu Ser His Val Asn Leu Pro Val He Thr Asn Ala Val Cys 
145 150 155 160 



Arg Asn Ser Phe Ser Leu Leu He Gin Asp Ser Asn He Cys Thr Ser 
165 170 175 



Gly Ala Asn Gly Arg Ser Thr Cys Arg Gly Asp Ser Gly Gly Pro Leu 
180 185 190 



Val Val Thr Arg Asn Asn Arg Pro Leu Leu He Gly Val Thr Ser Phe 
195 200 205 



Gly Ser Ala Arg Gly Cys Gin Val Gly Ser Pro Ala Ala Phe Ala Arg 
210 215 220 
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Val Thr Ser Tyr He Ser Trp He Asn Gly Gin Leu 
225 230 235 



<210> 77 

<211> 107 

<212> PRT 

<213> peptide 

<4QQ> 77 

Met Glu Ser Lys Phe Ala His He He Val Phe Phe Leu Leu Ala Thr 
15 10 15 



Ser Phe Glu Thr Leu Met Ala Arg Lys Glu Ser Asp Gly Pro Glu Val 
20 25 30 



He Glu Leu Leu Lys Glu Phe Glu Cys Asn Gly Lys Gin Phe Trp Pro 
35 40 45 



Glu Leu He Gly Val Pro Thr Lys Leu Ala Lys Glu He He Glu Lys 
50 55 60 



Glu Asn Ser Leu He Asn Asn Val Gin He Leu Leu Asn Gly Ser Pro 
65 70 75 80 



Val Thr Met Asp Tyr Arg Cys Asn Arg Val Arg Leu Phe Asp Asn He 
85 90 95 



Leu Gly Ser Val Val Gin He Pro Arg Val Ala 
100 105 
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<210> 78 

<211> 107 

<212> PRT 

<213> peptide 

<400> 78 

Met Glu Ser Lys Phe Ala His He He Val Phe Phe Leu Leu Ala Thr 
15 10 15 



Ser Phe Glu Thr Leu Leu Ala Arg Lys Glu Ser Asp Gly Pro Glu Val 
20 25 30 



He Glu Leu Leu Lys Glu Phe Glu Cys Asn Gly Lys Gin Phe Trp Pro 
35 40 45 



Glu Leu He Gly Val Pro Thr Lys Leu Ala Lys Glu He He Glu Lys 

.50 55 60 



Glu Asn Ser Leu He Asn Asn Val Gin He Leu Leu Asn Gly Ser Pro 
65 70 75 80 



Val Ala Met Asp Tyr Arg Cys Asn Arg Val Arg Leu Phe Asp Asn He 
85 90 95 



Leu Gly Ser Val Val Gin He Pro Arg Val Ala 
100 105 
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<210> 79 
<211> 71 
<212> PRT 
<213> peptide 

<400> 79 

Lys Glu Phe Glu Cys Asp Gly Lys Leu Gin Trp Pro Glu Leu He Gly 
15 10 15 



Val Pro Thr Lys Leu Ala Lys Glu He He Glu Lys Gin Asn Ser Leu 
20 25 30 



He Ser Asn Val His lie Leu Leu Asn Gly Ser Pro Val Thr Met Asp 
35 40 45 



Phe Arg Cys Asn Arg Val Arg Leu Phe Asp Asp He Leu Gly Ser Val 
50 55 60 



Val Gin He Pro Arg Val Ala 
65 70 



<210> 80 

<211> 106 

<212> PRT 

<213> peptide 

<400> 80 

Met Glu Ser Lys Phe Ala His He He Val Phe Phe Leu Leu Ala Thr 
15 10 15 
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Ser Phe Glu Thr Leu Leu Ala Arg Lys Glu Ser Asp Gly Pro Glu Val 
20 25 30 



He Glu Leu Gin Lys Glu Phe Glu Cys Asn Gly Lys Gin Arg Trp Pro 
35 40 45 



Glu Leu He Gly Val Pro Thr Lys Leu Ala Lys Gly He He Glu Lys 
50 55 60 



Glu Asn Ser Leu He Thr Asn Val Gin He Leu Leu Asn Gly Ser Pro 
65 70 75 80 



Val Thr Met Asp Tyr Arg Ser Asn Arg Val Arg Leu Phe Asp Asn He 
85 90 95 



Leu Gly Asp Val Val Gin He Pro Arg Val 
100 105 



<210> 81 

<211> 111 

<212> PRT 

<213> peptide 

<400> 81 

Met Glu Ser Lys Phe Ala His He He Val Phe Phe Leu Leu Ala Thr 
15 10 15 



Ser Phe Glu Thr Leu Met Ala Arg Lys Glu Gly Asp Gly Ser Glu Val 
20 25 30 
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Ile Lys Leu Leu Lys Glu Ser Glu Ser Glu Ser Trp Cys Lys Gly Lys 
35 40 45 



Gin Phe Trp Pro Glu Leu lie Gly Val Pro Thr Lys Leu Ala Lys Glu 
50 55 60 



He He Glu Lys Glu Asn Pro Ser He Asn Asp Val Pro He He Leu 
65 70 75 8*0 



Asn Gly Thr Pro Val Pro Ala Asp Phe Arg Cys Asn Arg Val Arg Leu 
85 90 95 



Phe Asp Asn He Leu Gly Asp Val Val Gin He Pro Arg Val Ala 
100 105 HO 



<210> 82 

<211> 111 

<212> PRT 

<213> peptide 

<400> 82 

Met Glu Ser Lys Phe Ala His He He Val Phe Phe Leu Leu Ala Thr 
15 10 15 



Ser Phe Glu Thr Leu Met Ala Arg Lys Glu He Asp Gly Pro Glu Val 
20 25 30 



He Glu Leu Leu Lys Glu Phe Asp Ser Asn Leu Met Cys Glu Gly Lys 
35 40 45 
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Gln Met Trp Pro Glu Leu He Gly Val Pro Thr Lys Leu- Ala Lys Glu 
50 55 60 



He He Glu Lys Glu Asn Pro Ser He Thr Asn He Pro He Leu Leu 
65 70 75 80 



Ser Gly Ser Pro He Thr Leu Asp Tyr Leu Cys Asp Arg Val Arg Leu 
85 90 95 



Phe Asp Asn He Leu Gly Phe Val Val Gin Met Pro Val Val Thr 
100 105 HO 



<210> 83 

<211> 107 

<212> PRT 

<213> peptide 

<400> 83 

Met Val Lys Phe Ala His Val Val Ala Phe Leu Leu Leu Ala Ser Leu 
15 10 15 



He Gin Pro Leu Thr Ala Arg Asp Leu Glu He Asn Val Leu Gin Leu 
20 25 30 



Asp Val Ser Gin Ser Gly Cys Pro Gly Val Thr Lys <31u Arg Trp Pro 
35 40 45 
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Glu Leu Leu Gly Thr Pro Ala Lys Phe Ala Met Gin lie lie Gin Lys 
50 55 60 



Glu Asn Pro Lys Leu Thr Asn Val Gin Thr He Leu Asn Gly Gly Pro 
65 70 75 80 



Val Thr Glu Asp Leu Arg Cys Asn Arg Val Arg Leu Phe Val Asn Val 
85 90 95 



Leu Asp Phe He Val Gin Thr Pro Gin He Gly 
100 105 



<210> 84 

<211> 73 

<212> PRT 

<213> peptide 

<400> 84 

Met Ser Ser Thr Glu Cys Gly Gly Gly Gly Gly Gly Ala Lys Thr Ser 
15 10 15 



Trp Pro Glu Val Val Gly Leu Ser Val Glu Asp Ala Lys Lys Val He 
20 25 30 



Leu Lys Asp Lys Pro Asp Ala Asp He Val Val Leu Pro Val Gly Ser 
35 40 45 



Val Val Thr Ala Asp Tyr Arg Pro Asn Arg Val Arg He Phe Val Asp 
50 55 60 
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Ile Val Ala Gin Thr Pro His He Gly 
65 70 



<210> 85 

<211> 70 

<212> PRT 

<213> peptide 

<400> 85 

Thr Glu Phe Gly Ser Glu Leu Lys Ser Phe Pro Glu Val Val Gly Lys 
15 10 15 



Thr Val Asp Gin Ala Arg Glu Tyr Phe Thr Leu His Tyr Pro Gin Tyr 
20 25 30 



Asp Val Tyr Phe Leu Pro Glu Gly Ser Pro Val Thr Leu Asp Leu Arg 
35 40 45 



Tyr Asn Arg Val Arg Val Phe Tyr Asn Pro Gly Thr Asn Val Val Asn 
50 55 60 



His Val Pro His Val Gly 
65 70 
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<210> 86 
<211> 60 
<212> DNA 
<213> peptide 

<400> 86 

ggatccatga aactcttggc tgtgactcta ttggctttcg ccgcggtcgt ctccgcgagg 

<210> 87 

<211> 18 

<212> PRT 

<213> peptide 

<400> 87 

Met Lys Leu Leu Ala Val Thr Leu Leu Ala Phe Ala Ala Val Val Ser 
15 10 15 



Ala Arg 



<210> 88 

<211> 40 

<212> DNA 

<213> artificial sequence 
<220> 

<223> FwBacRECH2 primer 

<400> 88 

ggatccatga aactcttggc tgtgactcta ttggctttcg 
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<210> 89 

<211> 40 

<212> DNA 

<213> artificial sequence 
<220> 

<223> FwBacRECH2 primer 

<400> 89 

ttggctttcg ccgcggtcgt ctccgcgagg aacgggtccc 40 



<210> 90 
<211> 864 
<212> DNA 
<213> peptide 

<400> 90 

aacggatccc accatcacca tcaccatgtt cacctcgagg attctattga tctggaagat 60 

attaccgctt ggggatacct caccaaattg ggtattccag aagctgagaa aatccgcaac 120 

gctgaagaag ctagctctgc tagcaggatc gtcggtggtt cattgtccag tgtcggacag 180 

atcccttacc aggctggtct cgtcattgac ttagcaggtg gccaggctgt ctgcggaggc 240 

tccctgatca gcgcttcccg cgtactgacc gctgctcact gctggttcga cggccaaaac 300 

caggcctgga gattcaccgt tgttcttggt tccaccacct tgttctctgg cggtaccaga 360 

atccctacat ccaatgttgt tatgcacgga agctggactc ctagccttat ccgtaacgat 420 

gttgccgtaa tcagattggg caccaacgta gcaacctcaa acaccattgc catcatcgct 480 

ctacccagcg gcagccagat caacgagaac ttcgccggtg aaaccgccct cgcctccggc 540 

ttcggtctca ccagtgacac cggcagcatc tccagcaacc aggctctgag ccacgtcaac 600 
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ctgccagtga tcaccaacgc tgtgtgcaga aattcattcc ccctgctgat ccaggactct 660 

aacatttgca ccagcggtgc caacggcagg agcacttgcc gcggtgactc cggcggtcct 720 

ctcgtcgtca ccaggaacaa cagaccactc ttgatcggta tcacctcttt cggatctgcc 780 

cgcggttgcc aagttggatc tcccgctgcc ttcgccagag tcacctctta catcagctgg 840 

atcaacggcc agctctaaaa gctt 864 



<210> 91 

<211> 287 

<212> PRT 

<213> peptide 

<400> '91 

Asn Gly Ser His His His His His His Val His Leu Glu Asp Ser lie 
15 10 15 



Asp Leu Glu Asp He Thr Ala Trp Gly Tyr Leu Thr Lys Phe Gly He 
20 25 30 



Pro Glu Ala Glu Lys He Arg Asn Ala Glu Glu Ala Ser Ser Ala Ser 
35 40 45 



Arg He Val Gly Gly Ser Leu Ser Ser Val Gly Gin He Pro Tyr Gin 
50 55 60 



Ala Gly Leu Val He Asp Leu Ala Gly Gly Gin Ala Val Cys Gly Gly 
65 70 75 80 
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Ser Leu He Ser Ala Ser Arg Val Leu Thr Ala Ala His Cys Trp Phe 
85 90 95 



Asp Gly Gin Asn Gin Ala Trp Arg Phe Thr Val Val Leu Gly Ser Thr 
100 105 110 



Thr Ley Phe Ser Gly Gly Thr Arg He Pro Thr Ser Asn Val Val Met 
115 120 125 



His Gly Ser Trp Thr Pro Ser Leu He Arg Asn Asp Val Ala Val He 
130 135 140 



Arg Leu Gly Thr Asn Val Ala Thr Ser Asn Thr He Ala He He Ala 
145 150 155 160 



Leu Pro Ser Gly Ser Gin He Asn Glu Asn Phe Ala Gly Glu Thr Ala 
165 170 175 



Leu Ala Ser Gly Phe Gly Leu Thr Ser Asp Thr Gly Ser He Ser Ser 
180 185 190 



Asn Gin Ala Leu Ser His Val Asn Leu Pro Val He Thr Asn Ala Val 
195 200 205 



Cys Arg Asn Ser Phe Pro Leu Leu He Gin Asp Ser Asn He Cys Thr 
210 215 220 



Ser Gly Ala Asn Gly Arg Ser Thr Cys Arg Gly Asp Ser Gly Gly Pro 
225 230 235 240 
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Leu Val Val Thr Arg Asn Asn Arg Pro Leu Leu lie Gly lie Thr Ser 
245 250 255 

Phe Gly Ser Ala Arg (?ly Cys Gin Val Gly Ser Pro Ala Ala Phe Ala 
260 265 270 

Arg Val Thr Ser Tyr lie Ser Trp He Asn Gly Gin Leu Lys Leu 
275 280 285 



<210> 92 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> RvRECH primer 

<400> 92 

gatcaacggc cagctctaaa agctt 

<210> 93 

<211> 15 

<212> PRT 

<213> peptide 

<400> 93 



He Val Gly Gly Ser Thr Ser Ser Leu Gly Ala Thr Pro Thr Gin 
15 10 15 
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